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ELECTRICAL STIMULATION OF THE BRAIN AND THE 
PSYCHOPHYSIOLOGY OF LEARNING 
AND MOTIVATION! 
H. PHILIP ZEIGLER? 
Departments of Psychology and Neurophysiology,* University of Wisconsin 


As a physiological technique, elec- 
trical stimulation of the brain dates 
from the classic studies of Fritsch and 
Hitzig (24) on the motor cortex of 
the dog. The use of this technique in 
behavioral research, however, is a 
more recent development. Since the 
pioneer studies of Loucks (62, 63, 
64), psychologists have evinced spo- 
radic interest in electrical stimulation 
of the brain as an important research 
tool in elucidating the neural mecha- 
nisms underlying behavior. Recent 
findings in neurophysiology, and ad- 
vances in electrophysiological meth- 
odology, have led to a-resurgence of 
this interest. Hess’s studies of the 
diencephalon (41, 42, 43, 44, 45, 46), 


1 The following abbreviations have been 
used in this review: CER (Conditioned 
Emotional Response), CFR (Conditioned 
Fear Response), CM (Centrum Medianum 
nucleus of the thalamus), CNS (Central 
Nervous System), CR (Conditioned Re- 
sponse), CS (Conditioned Stimulus), EEG 
(Electroencephalograph), GSR (Galvanic Skin 
Response), S-S (Sensory-Sensory), S-R (Stim- 
ulus Response), UCS (Unconditioned Stimu- 
lus), VM (Ventro Medialis nucleus of the 
thalamus), VPL (Ventro-Postero-Lateralis 
nucleus of the thalamus 

2 The author wishes to express his gratitude 
to Prof. W. J. Brogden, Department of Psy- 
chology, and to Drs. K. Akert, R. Benjamin, 
and W. Welker, Department of Neurophysi- 
ology, University of Wisconsin, for 
encouragement and helpful criticisms. 

* This review was completed during the 
author’s tenure as Research Fellow, National 
Institute of Mental Health. 
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Magoun's work on reticular activat- 
ing systems (70), and studies of sen- 
sory and motor representation by 
Woolsey (101) have been of particu- 
lar interest to psychologists. More- 
over, some of the techniques de- 
veloped in the course of this research 
have been adapted for use with hu- 
man subjects. The studies of Penfield 
and his associates, on patients under- 
going neurosurgery, are classic ex- 
amples of the application of these 
techniques to the investigation of 
human brain function (88). The re- 
sults so obtained have confirmed and 
extended the findings on cortical 
localization obtained from lower ani- 
mals. Studies employing stimulation 
of the human brain through elec- 
trodes implanted in various subcorti- 
cal structures are currently in pro- 
gress (37, 38). 

The investigations cited above 
have provided a wealth of factual ma- 
terial regarding neurophysiological 
processes, but the relation of these 
processes to behavior remains ob- 
scure. As Teuber has pointed out, 
“The absence of any convincing 
physiological correlate of learning is 
the greatest gap in physiological psy- 
chology” (99). Electrical stimulation 
of the brain is among the most prom- 
ising of the available techniques for 
establishing such correlates. Its 
greatest advantage over cerebral ab- 
lation is that it makes possible the 
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study of the intact nervous system in 
the unanesthetized animal under con- 
ditions closely approximating those 
in which the learning process occurs. 

This paper is primarily a review of 
research in which electrical stimula- 
tion of the central nervous system is 
employed in the investigation of 
neural mechanisms involved in the 
learning process. Studies of electro- 
convulsive shock (ECS) and learning 
have been omitted, because they em- 
ploy electrical stimulation in a mark- 
edlydifferent manner. Although prob- 
lems of electrophysiological meth- 
odology will be briefly discussed 
wherever relevant, this paper is not 
primarily concerned with such prob- 
lems. Readers interested in these 
problems are referred to publications 
of Hess (41, 46), Delgado (17, 19), 
Lilly (54, 55), Heath (37), Monnier 
and Laue (75), etc., and to a recent 
volume on brain stimulation edited 


by Sheer (94). 
INTRANEURAL CONDITIONING 


In reviewing a series of studies de- 
signed to determine the neural cor- 
relates of the conditioned response, 
Gantt (27) suggested the terms intra- 
neural and extraneural conditioning. 
As opposed to ordinary (extraneural) 
conditioning, in which the UCS and 
CS arise outside the central nervous 
system, intraneural conditioning re- 
quires that the UCS and/or CS be 
produced by direct stimulation of the 
CNS, and that a CR be established 
under conditions precluding the pos- 
sibility of artifactual stimulation. 
The pioneer studies in this area, those 
of Loucks, Gantt, and their collabor- 
ators, were made feasible by the de- 
velopment of Loucks’ technique for 
brain stimulation of chronic prepara- 
tions (61). The technique utilizes a 
collodion covered coil, buried just 
beneath the skin, from which insu- 
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lated wires lead to an implanted elec- 
trode. When the primary of an in- 
ductorium is laid over the skin, adja- 
cent to the coil, the induced current 
is conducted to the electrodes. It was 
thus possible to substitute direct 
CNS stimulation for the CS or UCS 
in a conditioning experiment. (Be- 
cause of the possibilities of artifactual 
stimulation (13) and certain problems 
of parametric control, this technique 
is no longer in general use.) 

The results of these early studies 
of intraneural conditioning in dogs 
may be summarized as follows: 

1. Stimulation of the motor cortex 
(UCS) paired with an externally pre- 
sented CS did not produce condition- 
ing. Presentation of food following 
each trial resulted in effective condi- 
tioning (62). 

2. Stimulation of the visual cortex 
(CS) paired with an externally pre- 
sented UCS resulted in the formation 
of conditioned responses (63). 

3. Stimulation of the dorsal col- 
umns of the spinal cord (UCS) paired 
with an externally presented CS was 
ineffective in establishing condition- 
ing. However, stimulation of the dor- 
sal roots (UCS), or an increase in the 
intensity of dorsal column stimula- 
tion resulted in conditioning (64). 

4. Responses produced by stimula- 
tion of the vestibular ganglion (UCS) 
could be conditioned to a variety of 
external CS’s (66). 

5. Stimulation of the cerebellum 
(UCS) produced a variety of respon- 
ses, some of which could be condi- 
tioned to externally presented CS's 
(11). 

The studies themselves have been 
reviewed in detail elsewhere (34, 
76). Our concern in the present dis- 
cussion, therefore, will be with the 
use which has been made of these 
findings as experimental support for 
certain theories of learning. 
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The results of two of the experi- 
ments (62, 64) have been cited by 
certain theorists as being relevant to 
one or both of two major theoretical 
issues. The first of these concerns the 
role of drive reduction in learning; 
the second raises the general problem 
of whether the S-S or S-R paradigm 
more adequately describes the learn- 
ing process. Spence (97) has cited the 
study pairing stimulation of the 
motor cortex with external CS’s as 
support for the drive-reduction view- 
point, since no conditioned responses 
were established until food was given 
following each trial (62). Supporters 
of an S-S paradigm have attributed 
the failure of conditioning in 
study to an absence of ‘contiguity 
of afferent processes” (4,71). That is, 
the assumption is made that the UCR 
was produced directly through motor 
stimulation, omitting the afferent in- 
put normally provided by an exter- 
nally presented UCS. The experi- 
ment on stimulation of spinal col- 
umns vs. roots has been similarly dis- 
Spence (97) cites this study 
as evidence for a drive-reduction 
viewpoint, on the assumption that 
there was no conditioning until stimu- 


this 


cussed. 


lation was intense enough to spread to 
adjacent pain fibers. Proponents of 
an S-S paradigm, however, have at- 
tributed this failure to 
tive 


‘lack of effec- 
relationships of the 
UCS” (4), in a manner similar to 
their interpretation of the motor 
stimulation experiment. None of 
these theorists has discussed the re- 
sults of intraneural conditioning 
studies involving cerebellar stimula- 
tion (11), although in this case condi- 
tioning was obtained in some ani- 
mals, despite by-passing of an exter- 
nal UCS, and absence of external rein- 
forcement. 

The relevance to learning theory 
of interpretations such as these de- 
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pends, in part, upon the validity of 
two implicit assumptions. It must 
first be demonstrated that we are 
dealing with intraneural and not ex- 
traneural conditioning; that is, the 
CS or UCS must be produced by elec- 
trical stimulation of the central nerv- 
ous system. It is probable that this 
assumption is not tenable for either 
experiment. In the study which em- 
ployed motor stimulation as_ the 
UCS and found no conditioning until 
food was presented following every 
trial (62), there is evidence that the 


animals were selectively learning 
through instrumental reward pro- 
cedures. Loucks attempted to test 


this hypothesis by presenting the 
food randomly, rather than following 
every trial. The results of this con- 
trol procedure supported the hypo- 
thesis of instrumental reward learn- 
ing, since with this modified pro- 
cedure there was complete failure to 
establish conditioned responses (62). 
The experiment, involving 
stimulation of the spinal cord, is also 
not a clear-cut case of intraneural 
conditioning. No conditioned re- 
sponses were obtained until the in- 
crease in stimulus intensity resulted 
in a “rough caudal jerk of the leg’”’ 
(64). Loucks points out that it was 
this rough caudal leg jerk, with its 
consequent excitation of nociceptive 
receptors, which was the actual UCS. 
That is, the UCS was peripheral, not 
central stimulation. 

The relation of these experiments 
to the S-S, S-R controversy rests on 
the assumption that the experiments 
have effectively separated the sensory 
and motor components of the condi- 
tioning process. In the light of cur- 
rent knowledge of the cortical repre- 
sentation of sensory and motor func- 
tion, this assumption is untenable. 
There is considerable evidence which 
indicates that the clear-cut distinc- 
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tion between cortical sensory and 
motor areas can no longer be main- 
tained. Sensory representation in the 
motor cortex (72, 100) and the cere- 
bellum (96) has been demonstrated. 
Moreover, the results of recent in- 
vestigations ‘‘suggest that motor 
functions are distributed throughout 
the unanesthetized cerebral cortex in 
the normal state, and [these results] 
imply, when correlated with sensory 
maps, that each small area of the cor- 
tex is truly ‘sensori-motor’, with a 
preponderance of one or the other 
function’’ (53). 

One final point is worthy of note. 
None of the hypotheses (S-S vs. S-R, 
drive reduction vs. sensory integra- 
tion) have been formulated in a man- 
ner that would make them accessible 
to neurophysiological verification. 


For this reason, as well as those dis- 
cussed above, the use of these experi- 
ments as support for any given learn- 
ing theory is probably unjustified. 


However, these early studies have 
played an important role in stimulat- 
ing further research in this area. 

In the period following the experi- 
ments of Loucks, Gantt, and their 
associates, work on intraneural condi- 
tioning has continued. ‘Reports by 
Culler (15) and Giurgea (35) exist 
only in abstract form, and have 
never been satisfactorily confirmed. 
Loucks has recently begun a program 
of intraneural conditioning studies on 
the domestic hen (65). Conditioned 
responses are established to electrical 
stimulation of the brain (CS) and the 
effects of implanting metal barriers 
in the vicinity of the electrodes are 
then investigated. The results indi- 
cate that barriers perpendicular to 
the surface of the brain appear to 
have negligible effects upon the es- 
tablishment or retention of 
tioned responses. 

In a paper concerned primarily 
with methodological problems in in- 


condi- 


H. PHILIP ZEIGLER 


traneural conditioning, Doty et al. 
report such conditioning in the cat 
(23). These investigators found that 
conditioned responses may be estab- 
lished to stimulation of the dura 
alone and have developed procedures 
for the control or elimination of ex- 
tracortical stimulation arising from 
afferents in the intracranial vascular 
system and the dura. Control of such 
artifactual CR’s is made feasible by 
the fact that extracortical stimula- 
tion of this nature can generally be 
detected by changes in GSR. Such 
artifacts are abolished by bilateral 
trigeminal neurotomy, which denerv- 
ates the pertinent region but does 
not interfere with the formation or 
retention of CR’s established solely 
to cortical stimulation. Pupillary 
dilatation, a common response to 
brain stimulation, and a_ possible 
source of extracortical propriocep- 
tives, may be controlled by the ap- 
plication of mydriatics to the eye. 
Doty’s results indicate that condi- 
tioned leg flexions may be established 
when stimulation of points in the 
marginal, postlateral, suprasylvian, 
or ectosylvian gyri serves as the CS. 
(These areas include the primary vis- 
ual and auditory cortex, and stimula- 
tion of their homologues in man elicits 
visual and auditory sensations [88].) 
CR’'s once established may be main- 
tained over periods up to _ three 
months following suspension of train- 
ing. It is difficult to interpret the re- 
sults on differentiation, generaliza- 
tion, and the differential effectiveness 
of cortical versus peripheral CS's be- 
cause of the extreme individual vari- 
ability reported. (The criterion used, 
15 out of 25, seems low in view of such 
high variability.) Some preliminary 
results on the role of subcortical 
structures in intraneural conditioning 
have been reported (93). These find- 
ings suggest that undercutting of the 
stimulated areas may produce severe 
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deficits in established CR’'s. Circum- 
section or isolation of such areas ap- 
pears to have negligible effects upon 
established CR’s. 

These results, though preliminary, 
indicate that electrical stimulation 
of the brain may be of considerable 
value in research on the neural mech- 
anisms involved in 
response. 


the conditioned 


NEURAL MECHANISMS IN LEARNING 
AND MOTIVATION 


Perhaps the most noticeable trend 
in current research on brain stimula- 
tion and learning is the shift in em- 
phasis from cortical to subcortical 
structures. Although the importance 
of these structures has long been rec- 
ognized, certain recent findings in 
neurophysiology have been of partic- 
ular interest to physiological psy- 
chologists. Studies of diencephalic 
and reticular “activating systems” 


suggest that these ‘‘systems”’ play an 


important role in the regulation of 


cortical activity. Stimulation of cer- 
tain diencephalic and mesencephalic 
structures in intact animals elicits an 
“alert’’ EEG, and behavior which has 
been described as “apprehension” or 
“‘alertness’’ (47, 70). Lesions in these 
structures result in an EEG charac- 
teristic of sleep, and a striking paucity 
of behavior (57, 58). Stimulation of 
other structures in these regions, and 
in the rhinencephalon, elicits a num- 
ber of highiy integrated behavior pat- 
terns as well as a wide variety of com- 
plex somatic and autonomic re- 
sponses (51, 69). For a general review 
of this research, the reader is referred 
to papers by Lindsley (60), Teuber 
(99), Gloor (36), Magoun (70), 
Jasper (48, 50), and Stellar (98), and 
to the UNESCO 
“Brain Mechanisms and Conscious- 
(16). 

On the assumption that the inter- 
action of certain cortical and subcor- 


symposium on 
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tical mechanisms may underlie such 
psychological phenomena as_ con- 
sciousness, emotion, motivation, at- 
tention, ete., several theorists have 
speculated upon the implications of 
these physiological findings for be- 
havior theory (39, 40, 59, 87). The 
studies to be reviewed below are 
based upon hypotheses suggested by 
these findings. 


REINFORCEMENT PRODUCED BY 
ELECTRICAL STIMULATION OF 
THE BRAIN 

Negative Reinforcement 

Over a period of 30 years, Hess 
(41, 42, 43, 44, 45, 46) has used brain 
stimulation of unanesthetized, intact 
animals to map the entire dienceph- 
alon, recording photographically the 
responses elicited, and verifying the 
electrode placements histologically. 
In addition to affecting such func- 
tions as circulation, respiration, tem- 
perature regulation, and digestion 
(36, 90), stimulation of this region 
may also elicit integrated behavior 
patterns. One of these patterns, the 
“‘affective-defense reaction” (42), has 
long been familiar to psychologists as 
the ‘‘sham-rage’”’ or ‘“‘pseudo-affec- 
tive’ reaction of the Cannon-Bard 
theory of emotion (2, 3). This pat- 
tern may be elicited by electrical 
stimulation of a zone centered in the 
perifornical region of the hypothala- 
mus and extending rostrally to the 
medial preoptic area and the base of 
the septum (36). Hess has described 
the resulting behavior as follows: 

“All the phenomena of growling, 
ear retraction, unsheathed claws, 
tail lashing, ete; are 
At this stage of general ex- 
citement, a slight movement on the 
part of the observer is enough to 
make him the object of a brisk and 
well directed attack. ... At the end 
of the stimulation, there is a rapid 
breakdown of the syndrome; even 


piloerection, 
present. 
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the most impulsive attack is stopped, 
and the animal is, as a rule, willing 
to be petted. But the threshold for 
angry behavior is maintained at a 
low level for a longer period” (45). 
In discussing the differentiation 
which many authors have made be- 
tween “normal” and ‘‘sham-rage”’ 
reactions, Hess notes: 

“This differentiation does not ap- 
ply to our observations, since the re- 
actions of the animals during hypo- 
thalamic stimulation differed in no 
way from those observed by us un- 
der normal environmental condi- 
tions. ... Our observations... 
are based on criteria identical with 
those conventionally applied to the 
phenomena of expressive mecha- 
nisms’’ (45). 

For a time, observations of emo- 
tional behavior elicited by hypothala- 
mic stimulation were used to support 
the concept that the hypothalamus 
was a “‘center’’ for emotion. How- 


ever, as the conceptual pendulum 


swung from the extremes of strict 
localization theory to the extremes of 
field theory, many investigators be- 
gan to question the meaningfulness 
and utility of the general concept of 
“centers.”’ In an attempt to evaluate 
the behavioral status of some of the 
responses produced by hypothalamic 
stimulation, and to clarify the role of 
this structure in emotional behavior, 
several conditioning and _ learning 
studies have been carried out. The 
first of these experiments was per- 
formed by Masserman, in an at- 
tempt to demonstrate the inadequacy 
of the concept that the hypothalamus 
was the “‘center’’ for emotional be- 
havior. It was Masserman's conten- 
tion that hypothalamic stimulation 
produced only the external manifes- 
tations of anger, and not ‘‘a true af- 
fective experience, which would be 
anticipated at the sensory signal and 
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for which the animal could learn to 
prepare, compensate, or adapt”’ (73). 

After demonstrating that cats could 
learn an escape or avoidance response 
to peripheral shock, Masserman (73) 
attempted to condition emotional 
responses elicited by hypothalamic 
stimulation. A variety of CS’s were 
used, and paired with hypothalamic 
stimulation. However, though the 
characteristic ‘‘emotionality’’ invari- 
ably followed such stimulation, the 
results indicated no escape or avoid- 
ance conditioning to the CS. Masser- 
man interpreted these results as indi- 
cating that ‘direct stimulation of the 
hypothalamus induces dramatic mi- 
metic-emotional effects, but these are 
not accompanied by the experiential 
or conative components of true rage 
or fear’ (73). 

A number of factors mitigate 
against the acceptance of such an 
interpretation. Precise control of 
electrode localization and stimula- 
tion parameters was lacking. In an 
area such as the hypothalamus, with 
its complex arrangement of nuclei in 
close proximity, minute variations 
in locus or stimulation parameters 
may result in markedly different 
response patterns. Indeed, Masser- 
man's description of the behavior of 
his animals differs in a number of 
respects from the observations of 
Hess. Masserman reports that the 
behavior was maladaptive, was not 
specifically directed against objects in 
the environment, had little effect 
upon ongoing behavior, and showed 
little or no emotional aftereffects 
when hypothalamic stimulation was 
terminated. These differences sug- 
gest that Masserman may not have 
been dealing with the total ‘‘affec- 
tive-defense reaction’ as Hess has 
described it. It is also possible that 
failure to achieve successful avoid- 
ance conditioning was due to the fact 
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that the experimental situation did 
not permit the animals to establish 
a specific avoidance or escape re- 
sponse during hypothalamic stimula- 
tion. In instrumental avoidance con- 
ditioning, such a response is followed 
by termination of the shock. Lacking 
pretraining, Masserman’s animals 
possessed no specific instrumental re- 
sponse. 

Recent studies of avoidance condi- 
tioning to hypothalamic stimulation 
suggest that the latter factor may 
have been, in part, responsible for 
Masserman’s negative results. Using 
a shuttle box, Cohen, Brown, and 
Brown (14) conditioned an escape 
response (jumping the barrier) to 
hypothalamic stimulation. Within 
200 trials, this behavior was developed 
into a consistent avoidance response 
to a previously neutral CS. These 
investigators interpret their results 
in drive-reduction terms; they view 
hypothalamic stimulation as a drive- 
arousing or energizing stimulus, whose 
termination produces the reinforce- 
ment for learning. It is their conten- 
tion that the behavior elicited when 


hypothalamic stimulation is used as 
the UCS in a learning situation would 
depend upon the characteristics of 


That is, 
and perhaps 
other types of learning could be dem- 


the experimental situation. 
avoidance, approach, 
onstrated in the same animal using 
identical localization 
stimulus parameters. By 
with hypotheses formulated in terms 
of “the experiential or conative com- 
(73), 


the Cohen and Brown hypothesis is 


electrode and 


contrast 


ponents of true rage and fear”’ 


accessible to experimental verifica- 
tion. 

Another group of studies of instru- 
mental avoidance learning motivated 
by brain stimulation 


is based upon 


the work of Delgado. This investi- 


gator has been concerned with the 
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elucidation of the ‘cerebral struc- 
tures involved in the transmission 
and elaboration of noxious stimula- 
tion’ (20). Delgado’s observations 
were made on chronic preparations, 
stimulated through electrodes in vari- 
ous subcortical and cortical struc- 
tures. Stimulation of the tegmentum, 
VPL nucleus of the thalamus, amyg- 
dala, hippocampus, crus of the fornix, 
and periacqueductal gray elicits 
response patterns characteristic of 
pain and/or emotion. These re- 
sponses include a variety of offensive 
and defensive behaviors, self-inspec- 
tion, vocalization, and autonomic re- 
actions; certain of them appear to 
form an integrated syndrome which 
Delgado calls ‘‘conditioned anxiety” 
(20). This syndrome is characterized 
by ‘“‘anxiety’’ responses to the obser- 
vation platforms on which the ani- 
mals have previously been stimu- 
lated, and a “permanent state of dis- 
trust’’ manifested toward observers 
or other animals present during such 
stimulation. Observations such as 
these led to experiments designed to 
determine whether this type of cen- 
tral stimulation could be conditioned, 
and used to motivate escape and 
avoidance behavior. 

It was found (18) that cats that 
had previously learned a wheel-turn- 
ing avoidance response to shock and 
CS, would continue to perform this 
response when brain stimulation was 
substituted for the CS. This re- 
sponse, in turn, was itself easily con- 
ditioned to a previously neutral CS. 
Similarly, an emotional disturbance 
produced by central stimulation could 
be conditioned to one of two distinc- 
tive compartments, and could be 
used as the UCS to motivate escape 
from the ‘“‘frightening’’ into the 
“safe’’ compartment. Electrical stim- 
ulation of the brain was also found to 
be an effective punishment in condi- 
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tioning hungry cats to avoid food. A 
recent study on the monkey (22) indi- 
cates that such stimulation is an ef- 
fective substitute for a CS in reinforc- 
ing instrumental shock-avoidance re- 
sponses previously conditioned to the 
CS. Monkeys were trained to avoid 
peripheral shock by overturning the 
left cup to a CS of a high tone. Turn- 
ing over the right cup in response to a 
low tone or no tone was reinforced 
with food. Brain stimulation was 
then administered during the no- 
tone trials, and it was found that such 
stimulation elicited avoidance re- 
sponses previously conditioned to 
high tone and shock. These responses 
are characterized as Conditioned 
Fear Responses (CFR). The authors 
advance the interpretation that ‘‘elec- 
trical stimulation .. . induces in the 
animal a condition similar to that 
which is present when it is anxious or 
afraid of being hurt’’ (22). 

Such an interpretation, as the 
authors themselves admit, is only one 


of a number of possible interpreta- 


tions. The most plausible of these 
alternate interpretations is that the 
stimulation was exciting pain fibers 
and therefore ‘‘elicited a pain-escape 
rather than a_ pain-avoidance re- 
sponse, (CFR)” (22). There is a con- 
siderable body of anatomical, physio- 
logical, and behavioral evidence which 
supports such an_ interpretation. 
Many of the structures stimulated 
are related to sensory fibers mediat- 
ing pain, through their connections 
with the trigeminal nerve, the spino- 
thalamic tract, and the VPL nucleus 
of the thalamus. The observations 
on protective movements, self-inspec- 
tion and vocalization suggest the pos- 
sibility of noxious stimulation, as 
does the fact that these responses are 
considerably reduced by anesthesia 
(20). Furthermore, the ‘anxiety’ 
and “permanent state of distrust’’ 
observed by Delgado in cats and 
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monkeys undergoing central stimula- 
tion has also been observed in rats 
subjected to prolonged or intensive 
peripheral shock (78). These animals 
display considerable aggression, di- 
rected toward observers, other ani- 
mals, or portions of their cage. In 
short, there is reason to believe that 
in the case of many of the structures 
involved, electrical stimulation is 
equivalent to a repetition of the orig- 
inal, painful, UCS; we are thus deal- 
ing with instrumental escape, rather 
than instrumental avoidance condi- 
tioning. 

Such an hypothesis will account 
for most of the observed results. 
However, stimulation of the hippo- 
campus, the amygdala, the fornix, 
and the periacqueductal gray also 
served to reinforce learning, without 
concurrently eliciting responses char- 
acteristic of pain. Information on 
the anatomical and functional rela- 
tionships of these structures is scanty. 
It is known that neurons of the mes- 
encephalic root of the trigeminus 
border on the periacqueductal gray 
and might have been affected by 
stimulus spread. Furthermore, sev- 
eral investigators have recorded elec- 
trical responses in the hippocampal 
region, following noxious stimulation 
of unanesthetized animals (68), and 
“fear’’ responses have been reported 
to stimulation of the amygdala and 
hippocampus (51). The evidence 
from human observations is com- 
pletely contradictory. According to 
Jasper (49), Penfield has never elic- 
ited anything resembling an emo- 
tional response to stimulation of the 
amygdala, the hippocampus or the 
hippocampal gyrus. Heath (38), on 
the other hand, reports the follow- 
ing observations on patients stimu- 
lated through electrodes implanted 
in these regions: 

“Stimulation of the amygdaloid 
nucleus resulted in intense emotional 
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reactions which varied from 
stimulation to the next, though pa- 
rameters of stimulation were kept 
constant. Sometimes this stimula- 
tion produced rage, sometimes fear. 
The patient’s reaction was, ‘I didn't 
know what came over me. | felt like 
an animal.’ Stimulation of the hippo- 
campus has produced anxiety... .” 
Differences in procedure and prob- 
lems of localization make comparison 
of the human results difficult. 
Perhaps the generalization 
that can be made in connection with 
negative reinforcement produced by 
brain stimulation is that, ‘“The emo- 
tional disturbance capable of moti- 
vating learning is not produced in- 
discriminately by electrical stimula- 
tion of any area of the brain, but is 
limited to the stimulation of specific 
areas”’ (18). Further experiments are 
needed to clarify the behavioral 
status of the responses produced by 
stimulation of these Such 
clarification will not be achieved if 
these experiments continue to be 
formulated in terms of the subjective 
aspects of the animals’ experience 
during electrical stimulation. The 
fact that they have been so formu- 
lated suggests that inadequate con- 
sideration has been given to the com- 
plex conceptual problems involved in 
research on emotional behavior. For 


one 


best 


areas. 


a detailed discussion of these prob- 
lems, the reader is referred to papers 


by Hebb (40), and Lindsley (59). 


Positive Reinforcement 


The first observations on this phe- 
nomenon were made by Olds (83) in 
the course of studies on the effects of 
electrical stimulation of the reticular 
formation. By accident, electrodes 
implanted in one of the experimental 
animals landed, not in the reticular 
formation, but in the septal region 
of the rhinencephalon. When stimu- 
lated in this area, the rat kept return- 
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ing to the corner of the observation 
box where it had last been stimu- 
lated. By appropriate manipulation 
of this effect, it was found that the 
animal could be directed to any part 
of the box. (This observation has 
since been confirmed in the monkey 
{21].) Animals with electrodes 
planted in this area were then tested 
in the Skinner box and it was found 
that rats who received electrical 
stimulation of this region following 
lever-pressing responses, would re- 
peatedly make such responses. More- 
over, they maintained extraordinarily 
high rates of responding, without ad- 
ditional forms of reinforcement (79). 
These findings were subsequently 
confirmed and extended to other 
species. It has since proved possible 
to develop and maintain stable lever- 
rates in cats, rats, and 
monkeys for periods up to six months 
(7, 95). 

Several investigators have explored 
the relationships between the rein- 
forcing effects of “intracranial self- 
stimulation” and learning motivated 
by primary reinforcers such as food, 
water, or sex. Variations in reinforce- 
ment schedule (continuous, fixed 
ratio, variable interval) have been 
found to result in differential lever- 
pressing curves similar to those ob- 
tained with corresponding variations 
in food or water reinforcement sched- 
ules (95). Rate of responding is sensi- 
tive, within limits, to changes in the 
intensity of electrical stimulation 
(“amount of reinforcement’’), and is 
negatively correlated with the dura-- 
tion of the interval between intra- 
cranial stimulations (7).  Interac- 
tions between the reinforcing prop- 
erties of intracranial self-stimulation 
and those of other reinforcers have 
also been demonstrated. Animals 
under food deprivation show higher 
response rates to intracranial self- 
stimulation than do animals under 


pressing 
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satiation conditions (7, 8). When 
both brain-stimulation and water- 
reinforcement levers are available on 
a continuous reinforcement schedule, 
the decrease in response rate to the 
water lever, as a function of satia- 
tion, is accompanied by a simultane- 
ous decrease in rate of responding to 
brain stimulation. If water reinforce- 
ment is given on a variable-interval 
schedule, while the brain-stimulation 
schedule remains continuous, re- 
sponse rate on both bars is main- 
tained at a relatively high level (7, 
8). 

Interactions between emotional be- 
havior and intracranial self-stimula- 
tion have also been reported. Using a 
Conditioned Emotional Response 
(CER) superimposed upon an estab- 
lished lever-pressing response, Brady 
and his associates have begun to ex- 
plore these interactions (5, 6, 7). 
After a stable lever-pressing response 
has been developed in rats or cats to 
water reinforcement, the CER is pro- 
duced by a series of conditioning 
trials in which an auditory stimulus 
is followed by shock. Within a short 
time, emotional behavior (crouching, 
defecation, immobility) has become 
conditioned to the CS; the extent of 
the conditioning is reflected in de- 
creased response rates following the 
CS. However, stable CER’s, originally 
established to water reinforcement, 
fail to appear when intracranial self- 
stimulation is substituted for water. 
Furthermore, when half-hour inter- 
vals of water reinforcement and intra- 
cranial self-stimulation are alternated 
within a single two-hour session, the 
CS does not elicit the CER during pe- 
riods of brain stimulation, even 
though termination of the CS is ac- 
companied on each trial by shock. It 
has also been found that resistance to 
the original development of the CER 
may be produced by providing the 
animal simultaneous access to a lever 
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delivering brain stimulation, during 
the process of emotional conditioning 
to water reinforcement (7, 9). The 
relation of such findings to the re- 
ported effects of subcortical lesions 
upon the CER (6, 9) is not yet clear. 
When an auditory CS is paired 
with intracranial  self-stimulation, 
without shock, the lever-pressing rate 
begins to increase during the five- 
minute period following the CS, be- 
fore the delivery of brain stimulation. 
With repeated trials, this increase 
becomes consistent and _ reliable, 
though response rates remain at signi- 
ficantly lower levels during the inter- 
vals between presentation of the CS 
(7). These findings suggest that in- 
tracranial self-stimulation may be 
used as a secondary as well as a pri- 
mary reinforcer in such situations. 
Olds (81) reports an experiment 
intended to compare the effective- 
ness of food versus brain stimulation 
in reinforcing maze and runway be- 
havior in rats. Latency differences 
between food and stimulation groups 
were not significant in the runway. 
In the maze, however, differences be- 
tween the number of trials to criter- 
ion were significant and favored the 
food group. A number of subsidiary 
observations are of some interest. 
While the performance of the food 
group improved on the first run of 
each succeeding day, the stimulation 
group showed a decrement. In a 
number of cases, electrode place- 
ments which had produced high re- 
sponse rates in the Skinner box failed 
to adequately reinforce maze and 
runway performance. Because of the 
problems involved in equating the 
groups with respect to the other vari- 
ables in maze learning, the two groups 
are not strictly comparable. 
Experiments of Brady and his as- 
sociates indicate that the effective- 
ness of intracranial self-stimulation 
as reinforcement is related, not only 
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to electrode locus, but to the locus of 
prior brain stimulation as well. The 
nature of these relationships has been 
explored in the monkey, using multi- 
lead electrodes which permit simul- 
taneous access to as many as 20 stim- 
ulation or recording points. It has 
been found that although high, stable 
response rates may be obtained from 
intracranial self-stimulation in the 
amygdala, prior stimulation in the 
medial forebrain bundle or orbital 
frontal region adversely affects these 
response rates. Similarly, prior stim- 
ulation of the hypothalamus or 
amygdala results in significantly 
lower and less stable rates of respond- 
ing to intracranial self-stimulation of 
the medial forebrain bundle or en- 
torhinal area (7). 

It has become increasingly appar- 
ent that electrode locus is among the 
most important of the variables in- 
fluencing the effectiveness of rein- 
forcement produced by brain stimula- 
tion. There is, for example, some evi- 


dence that the loci of electrode place- 
ments effective in producing such 
reinforcement may differ in the rat, 
the cat, and the monkey (7, 82, 95). 
However, exposition of the findings 


on anatomical localization, species 
differences, and behavioral effects is 
complicated by the fact that studies 
of intracranial self-stimulation 
often not comparable with respect to 
methodology, stimulus parameters, 
and behavioral criteria. On the basis 
of the studies reported thus far, only 
one anatomical generalization may be 
made. All the structures involved in 
this phenomenon may be included in 
or related to the “limbic 
system.” Several excellent reviews of 
its anatomy, physiology, and behavi- 
oral implications are available (9, 25, 
51, 67, 69, 85, 89). 

As might be expected, the discov- 
ery of positive reinforcement pro- 
duced by intracranial self-stimulation 


are 


so-called 
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has aroused considerable interest and 
a great deal of speculation. This 
speculation ranges from Delgado’s 
concept of ‘attractive’ and ‘‘unat- 
tractive’ cerebral areas (21), to Olds’ 
tentative conclusion that ‘“‘stimula- 
tion in these areas must excite some 
of the nerve cells that would be ex- 
cited by satisfaction of the basic 
drives—hunger, sex, thirst, and so 
forth’ (83). The analysis of neural 
and behavioral mechanisms mediat- 
ing the reinforcing effects of intra- 
cranial self-stimulation has hardly 
begun. Indeed, it has not yet been 
conclusively demonstrated that we 
are dealing with “‘pleasure centers’ 
(83) or “a system within the brain 
whose function it is to produce a re- 
warding effect upon behavior" (79). 

In the Skinnerian sense of a stimu- 
lus whose presentation increases the 
probability of occurrence of a prior 
response, intracranial self-stimula- 
tion must be termed reinforcing. 
However, the use of intracranial self- 
stimulation as reinforcement in a 
wider variety of learning situations 
would be desirable. To date, these 
reinforcing properties have been dem- 
onstrated only in lever-pressing and 
maze learning. Even in these situa- 
tions, certain possibly significant dif- 
ferences may be noted between food 
and water reinforcement and intra- 
cranial self-stimulation. 

1. Extinction of the lever-pressing 
response is extremely rapid, following 
the termination of intracranial rein- 
forcement and there seems to be an 
absence of spontaneous recovery (79). 

2. Rats could not learn a maze or 
runway if this involved running out 
of a starting alley across an open field 
into a goal box (85). 

3. The stimulation group in the 
maze experiments showed a decre- 
ment on the first trial of each succeed- 
ing day, while the food group showed 
improved performance (81). 
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4. Electrode placements which had 
produced high response rates in the 
Skinner box failed to adequately rein- 
force maze learning (81). 

5. In contrast to their perform- 
ance under food reinforcement, mon- 
keys undergoing intracranial self- 
stimulation did not respond satisfac- 
torily on a schedule in which rein- 
forcement was contingent upon the 
withholding of lever-pressing re- 
sponses. Animals could not regularly 
delay their responses for the twenty- 
second interval required to obtain 
reinforcement (10). 

These differences may have im- 
portant implications for the inter- 
pretation of the positive reinforce- 
ment produced by brain stimulation. 
They suggest that it may be prema- 
ture to conceptualize intracranial 
self-stimulation in terms of “‘ap- 
proach,” “reward,” or “pleasure’’ 
centers in the brain. Such “subcorti- 
cal phrenologizing’’ is no substitute 
for careful experimental analysis, and 
is no more likely to advance our 
knowledge of brain function than 
was “‘cortical phrenology.”’ 

BRAIN STIMULATION AND LEARN- 

ING: MISCELLANEOUS STUDIES 


The following studies are not 
directly concerned with intraneural 
conditioning or reinforcement pro- 
duced by brain stimulation. They 
have in common only the fact that 
they are designed to explore the ef- 
fects of brain stimulation during per- 
formance in a learning situation. 

In 1934 Gengerelli (28) presented 
a neurophysiological theory of learn- 
ing based primarily upon the effects 
of two parameters of electrical stimu- 
lation: frequency and pulse duration. 
Since that time, a considerable body 
of research on the behavioral and 
physiological effects of variations in 
stimulus parameters has been pub- 
lished (54, 74). Moreover, Gengerelli 


has recently developed a technique 
which permits remote-control stimu- 
lation of unanesthetized, freely run- 
ning animals (29, 30). Electrical 
stimulation is transmitted to im- 
planted electrodes by means of mini- 
ature receiving sets mounted directly 
on the animals. Employing this 
technique, Gengerelli has undertaken 
to test his theory in a number of 
learning situations, with rats. These 
experiments (31, 32, 33) are the first 
of a series designed to investigate the 
effects upon learning of variations in 
frequency, pulse duration and cere- 
bral locus; it is hoped that eventually 
the interaction of these variables may 
also be studied. 

As formulated by Gengerelli, the 
theory predicts only that stimulus 
frequency and duration will be im- 
portant variables in determining the 
effects of electrical stimulation of the 
brain upon learning. The experi- 
ments to date have been concerned 
with the differential effectiveness of 
two frequency rates upon perform- 
ance in the maze (31) and discrim- 
ination box (32, 33). The results are 
equivocal, indicating only that stim- 
ulation of the rat brain during per- 
formance may have some effect upon 
learning. The nature and direction 
of this effect is as yet undetermined. 
Further evaluation of the experi- 
ments is complicated by a number of 
factors. The possibility of nocicep- 
tive stimulation was not adequately 
controlled, and may account for most 
of the reported results. Electrode 
placements varied from animal to 
animal, and histological verification 
of electrode locus is lacking. No at- 
tempt was made to equate the effec- 
tive stimulus intensity received by 
each animal, and this facter may be 
important in view of the finding (30) 
that both patterns of response and 
thresholds for their elicitation change 
markedly over time. The ingenuity 
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and potential utility of the Gengerelli 
technique is such that it is to be 
hoped that these factors will be more 
carefully controlled in future experi- 
ments. 

Chiles (12) has reported an experi- 
ment on stimulation of the ‘“‘dien- 
cephalic activating system’’ during 
performance in a lever-pressing task. 
After having reached an asymptote 
on this task, cats were stimulated 
through electrodes implanted in the 
VM and CM nuclei of the thalamus, 
and in the posterior hypothalamus. 
Such stimulation resulted in a de- 
crease in lever-pressing and an in- 
crease in response variability. Chiles 
attributes the results to “distraction” 
produced by the sensory and/or 
motor consequences of the stimula- 
tion. Sensory effects from stimula- 
tion of these areas in humans, have 
been reported (38). 

In an attempt to delineate the 
structures involved in the behavioral 
effects of frontal lobe ablations or 
prefrontal lobotomy, Rosvold and 
Delgado (91) stimulated different 
levels of the anterior portion of the 
monkey brain during performance on 
delayed alternation and visual dis 
crimination tests. Although such 
stimulation produced responses rang- 
ing from hypo- to hyperactivity, 
there was no interference with the 
visual discrimination. Moreover, 
only those animals stimulated in the 
region of the head of the caudate nu- 
cleus showed any deficits in delayed 
alternation. The caudate group, how- 
ever, was significantly different from 
and inferior to the noncaudate ani- 
mals on this problem; their perform- 
ance fell to chance and remained at 
this level for the duration of the elec- 
trical stimulus. Caudate stimulation 
resulted in a decrease in activity 
which the authors describe as ‘“doz- 
ing’ (91). Akert and Andersson (1) 
have observed “‘inertness’’ as a conse- 


quence of caudate stimulation of the 
cat, and such inertness could be pro- 
duced in caudate monkeys by increas- 
ing the intensity or duration of the 
electrical stimulus (92). (In con- 
trast to the hypoactivity produced by 
caudate stimulation, destruction of 
caudate points by electro-coagula- 
tion resulted in persistent hyperac- 
tivity and continued, though tem- 
porary, deficits in delayed alterna- 
tion.) It has been suggested (99), 
that disturbance of postural com- 
ponents of behavior may play a ma- 
jor role in producing delayed-re- 
sponse deficits. If this hypothesis is 
correct, the effect of caudate stimula- 
tion—and_ destruction—upon_ de- 
layed alternation may be due to inter- 
ference with postural and/or orient- 
ing responses which serve as cues to 
the animal. The results of this experi- 
ment are provocative, and suggest 
that some of the deficits customarily 
attributed to ablation of frontal cor- 
tex may be more appropriately re- 
ferred to mechanisms involving both 
cortical and subcortical structures. 


PROBLEMS OF ELECTROPHYSIO- 
LOGICAL METHODOLOGY 


Whenever an electrical stimulus is 
applied to the CNS, interpretation of 
the responses elicited presents a prob- 
lem, not only for the psychologist, 
but for the biophysicist, the anatom- 
ist, and the physiologist. The bio- 
physical aspects of this problem 
stem from the fact that neural tissue 
is extremely sensitive to variations in 
the parameters of electrical stimula- 
tion. Even slight changes in fre- 
quency, intensity, duration, or wave- 
form may produce markedly different 
responses. Furthermore, as Lilly (56) 
has pointed out, while the fundamental 
of a given stimulus frequency may be 
affecting one structure, the harmonics 
of the frequency may concurrently be 
stimulating other areas. The intro- 
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duction of timers, relays, micro- 
switches, etc., into the circuit compli- 
cates the problem of stimulus control 
still further. Indeed, until such fac- 
tors are controlled, and stimulus pa- 
rameters are systematically varied 
for a specific structure in a given spe- 
cies, no conclusions as to function can 
be drawn solely from the results of 
electrical stimulation. 

The anatomical aspects of this 
problem have been clearly stated by 
Mountcastle (77): 

“A pair of electrodes cast into the 
brain may fall into a volume of solid 
grey cells and activate those cells 
alone. In another position, it may 
encompass within its adequate stimu- 
lating field a large number of fibers 
connected with diverse regions of 
grey within the brain.” In short, the 
physiological psychologist’s aim of 
correlating structure with function is 
complicated by the fact that the site 
of the stimulating electrode is not 
necessarily synonymous with the site 
of action of the stimulus current. 

The physiological problems in such 
research are also formidable. Reports 
of variations in impedance, thresh- 
old, and patterns of response to stim- 
ulation of chronic preparations, are 
common. Furthermore, intense or 
long continued electrical stimulation 
has an injurious effect upon neural 
tissue and may eventually produce 
lesions. Fortunately, progress has 
been made in controlling these injuri- 
ous effects. Techniques for this pur- 
pose have recently been developed 
(55) and are currently being used in 
studies on intracranial self-stimula- 
tion. Their extension to other areas 
of stimulation research may be ex- 
pected, if the favorable results re- 
ported to date are confirmed. In this 
connection, the relation between the 
behavioral effects of lesions and of 
electrical stimulation has never been 
clarified. The problem has assumed 
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growing importance because of the 
tendency of many investigators (9, 
80, 85) to interpret the behavioral 
effects of electrical stimulation in the 
light of effects produced by lesions in 
the structures involved. It has often 
been assumed, for example, that le- 
sions and stimulation produce di- 
rectly opposite effects upon behavior, 
but there is evidence (91) that such 
an assumption may represent a con- 
siderable oversimplification of the 
problem. 

One final point is worthy of note. 
As it has been used to date, electrical 
stimulation does not constitute 
‘physiological stimulation.”’ That is, 
the characteristics of the stimulus 
are such that its effects are not neces- 
sarily equivalent to normal, physio- 
logical excitation of nerve fibers. Its 
effects may be due either to excita- 
tion, inhibition, or interference with 
ongoing neural processes. 

Factors such as those summarized 
above represent only a portion of the 
problems involved in the use of elec- 
trical stimulation of the brain. They 
suggest that functional interpreta- 
tions cannot be made solely on the 
basis of stimulation experiments. 
They indicate further that extreme 
methodological sophistication, how- 
ever difficult to achieve, is a major 
prerequisite for competent research 
in this area. The growth of interdis- 
ciplinary research projects attests to 
the increasing recognition of this fact. 


CONCLUSIONS 


The psychologist employing brain 
stimulation as a technique for the 
study of the learning process is faced 
with several tasks. These include the 
experimental analysis of the behavior 
produced by such stimulation, the 
elucidation of physiological mechan- 
isms involved in this behavior, and 
the solution of methodological, con- 
ceptual, and theoretical problems. 





ELECTRICAL STIMULATION OF THE BRAIN 


Progress in this area, then, may be 
discussed in terms of each of these 
tasks. In view of the fact that the 
majority of the studies reviewed in 
this paper are of recent origin, it is 
not surprising that few generaliza- 
tions may as yet be made concerning 
the neurophysiological correlates of 
learning. Certain accomplishments 
and trends may be pointed out, how- 
ever, and their implications for future 
research noted. 

The use of brain stimulation as a 
behavioral research technique has 
opened new areas of research on the 
learning process. There has been a 
considerable accumulation of facts, 
and major advances have been made 
in electrophysiological methodology 
and instrumentation. Carefully con- 
trolled studies of cortical condition- 
ing are being carried out, and the 
interaction of cortical and subcortical 
mechanisms in such conditioning is 
being investigated. Electrical stimu- 
lation of the brain has proved a useful 
technique in the delineation of cere- 
bral structures related to emotion and 
nociception and the physiological cor- 
relates of motivational states are also 
being explored. Negative and posi- 
tive “reinforcement” produced by 
brain stimulation have been demon- 
strated, and the relation of such rein- 
forcement to a wide variety of behav- 
ioral and physiological variables is 
being studied. 

Detailed experimental analysis of 
the effects of brain stimulation upon 
learning and motivation has scarcely 
begun. In addition to its effects upon 
the performance change being meas- 
ured, e.g., response rate, such stimu- 
lation may also produce alterations in 
the general behavior of the animal. 
In the types of learning situations 


used to date, notably lever-pressing, 
such behavior tends to be overlooked 
completely or only casually observed. 
Analysis of such alterations in other 
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aspects of the animals’ behavior 
should prove fruitful in providing us 
with insights into the behavioral 
mechanisms mediating the effects of 
brain stimulation upon learning. 

In view of the avowed purpose of 
most of the studies reviewed in this 
paper, it is surprising that so few of 
them have actually been concerned 
with the elucidation of the physio- 
logical mechanisms involved in learn- 
ing and motivation. Except inferen- 
tially, we have hardly begun to in- 
vestigate the neural and biochemical 
processes which intervene between 
the stimulation of a given structure 
and the performance changes which 
we are measuring. Some of the tech- 
niques which would make such re- 
search feasible are already in exist- 
ence (7, 26, 25). These techniques 
permit the recording of electrical ac- 
tivity in diverse regions of the nerv- 
ous system, and the assessment of 
biochemical changes produced by 
electrical stimulation. Such proced- 
ures, when combined with electrical 
stimulation of the CNS, may make 
it possible to trace the complex chain 
of physiological events involved in 
learning. The effects upon these 
physiological processes of manipulat- 
ing certain behavioral variables may 
be similarly investigated. The re- 
search possibilities suggested by such 
techniques are among the most excit- 
ing yet encountered in physiological 
psychology. 

Perhaps our greatest need, how- 
ever, is for a type of conceptual clar- 
ity which can be achieved only 
through continual re-examination of 
the assumptions upon which our re- 
search is based. As Teuber has 
pointed out: 

“We still find uncritical references 
to one or another mode of parcella- 
tion of cortex or subcortex, as if the 
parcellations themselves (‘primary 
areas’, ‘association cortex’, ‘visceral 
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brain’) yielded functional systems. 
Our greatest hazard, however, lies in 
the use of inadequate terms in the 
description of altered behavior. What 
are ‘amnesia’, ‘tameness’, ‘savage- 
ness’, ‘visual discrimination deficit’, 
except mere symptoms which await 
further experimental analysis’ (99). 
Teuber’s comments were directed to 
the general area of physiological psy- 
chology, but the parcellation and in- 
adequate terminology to which he re- 
fers are particularly prevalent in cur- 
rent research on brain stimulation 
and learning: viz., ‘reward systems,”’ 
“pleasure centers,’’ ‘“‘attractive and 
unattractive cerebral areas,’’ ‘‘dis- 
traction,” etc. Many psychologists 
working in this area have either ig- 
nored conceptual problems com- 
pletely, or treated them in a superfi- 
cial and inadequate manner. 
Lashley (52), Hebb (40), Teuber 
(99), and others have called attention 
to these conceptual inadequacies and 
stressed the need for continual revi- 
sion of the psychologist’s ‘‘conceptual 
nervous system.’’ Sophisticated the- 
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oretical formulations may not yet be 
possible, but adequate conceptualiza- 
tions are a necessity. To cite Teuber 
once again, ‘‘No degree of refinement 
of ablation or stimulation techniques 
can substitute for clarity of concepts 
relating to structure and function” 
(99). 

Whatever the status of past and 
present research in this area, there 
can be little doubt of the potential 
utility of brain stimulation as a tech- 
nique for the study of the learning 
process. This utility will be consider- 
ably increased if brain stimulation is 
combined with electrical recording, 
ablation, and biochemical assessment 
techniques. Themethodological prob- 
lems to be solved are complex, but 
considerable progress has already 
been made and further advances may 
be expected. If experimental ingenu- 
ity and conceptual sophistication 


keep pace with progress in methodol- 
ogy and instrumentation, we may 
look forward to the eventual develop- 


ment of a psychophysiology of learn- 
ing and motivation. 
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LABORATORY STUDIES OF BEHAVIOR WITHOUT 


AWARENESS 


JOE K. ADAMS 
VA Hospital, Palo Alto! 


The term “behavior without aware- 
ness” has been used to cover a con- 
siderable variety of psychological 
processes. This review will attempt 
a classification of these processes in 
terms of the experimental conditions 
under which they have been observed. 
Many alternative classifications could 
be made; the chief virtue of the pres- 
ent one is that it tends to separate 
those results which can be relatively 
easily reproduced from those which 
can be reproduced only with con- 
siderable difficulty or not at all. 

The following kinds of observa- 
tions will not be included: 


1. Studies of posthypnotic sugges- 
tion. 

2. Studies of automatic writing. 

3. Studies of extrasensory percep- 


tion. 
4. Nonexperimental observations. 
5. Studies of processes of which Ss 
are never directly aware. 


Various aspects of behavior with- 
out awareness were reviewed in 1917 
by Coover (9), in 1940 by Collier (8), 
in 1942 by Miller (49), and in 1951 by 
Lazarus and McCleary (40). The 
present review, however, presents a 
coverage somewhat different 
any of these. 

By behavior the reviewer means 
anything that the individual does 
that is publicly observable, whether 
the behavior is defined in terms of 
movement, accomplishment, or phys- 
iological changes.? 


from 


1 This manuscript was prepared while the 


author held the Thomas Welton Stanford 
Fellowship in Psychical Research at Stanford 
University, 1955-56 

2? Some psychologists object to the inclusion 
of physiological changes under the term “be 


By awareness the reviewer means 
conscious awareness, which is taken as 
a primitive (undefined) term. Some 
psychologists consider the indices or 
criteria of awareness, e.g., appropri- 
ate verbalization, as the only mean- 
ing that can legitimately be given to 
the term; others use ‘“‘awareness”’ as 
a construct based upon such indices 
but without the substantive meaning 
of consciousness. The clarity gained 
by avoiding the phenomenological 
meaning is more apparent than real, 
as no psychologist to date has given 
an adequate behavioral definition of 
awareness except in terms of rather 
specific experimental conditions, in 
other words, the very same terms in 
which any psychologist specifies his 
indices or criteria of awareness. 

Lack of awareness by S may be 
with respect to: 

1. The behavior itself; e.g., S may 
be whispering without realizing that 
he is whispering, or he may be saying 
“vertical” on 60% of a sequence of 
trials without being aware that he is 
responding more than 50% of the 
time in this way.’ 

2. The relation of his behavior to 
some contingent event; e.g., S may 
be unaware that his behavior is being 
influenced by F’s saying “right.” 

3. The sensory experiences that 
usually accompany a given kind of 
stimulation; e.g., S may be unaware 
of a very faint light that is being pre- 
sented in the sense that he lacks the 
havior.’ There seems to be no generally ac 
cepted word which has the meaning desired. 

* Research which deals with processes of 
which Ss are mever directly aware, such as 
change in pupillary size and GSR, will not 
be included unless some other kind of lack of 
awareness is involved. 
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usual visual sensations. This is one 
of the traditional meanings of ‘‘lack of 
conscious awareness.”’ 

4. ‘The fact that he is discriminat- 
ing the presence from the absence of 
a given stimulus better than chance 
(this kind of lack of awareness often 
follows from 3, under suitable labora- 
tory conditions). 

5. The fact that he is responding 
differentially to different stimuli in a 
particular systematic way. 

6. Contingencies in the environ- 
ment which produce changes in S’s 
behavior; e.g., Smay be unaware that 
a given word has preceded electric 
shock, though the word produces 
changes in heart rate or GSR. 

There is probably no psychologist 
who doubts the existence of any of 
the foregoing kinds of behavior with- 
out awareness, if he is allowed to ex- 
press them in the terminology he pre- 
fers. For example, among clinical 
psychologists it is widely believed 
that in ordinary social interaction 
people often respond, in ways of 
which they are unaware, to stimuli 
which they are unaware that they 
are responding to, and that such be- 
havior without awareness can be of 
special importance in psychopathol- 
ogy. In therapy great stress is placed 
on the importance of becoming aware 
of the stimuli to which one responds, 
of the response one makes, and of the 
reasons that one responds in the way 
one does. 

Experimental psychologists dem- 
onstrate their belief in the occurrence 
of behavior without awareness by 
their insistence upon certain kinds of 
methodological precautions. If it is 
asserted that a functional relation- 
ship has been demonstrated by a 
given experiment, and if through ex- 
perimental error or faulty design 
some other stimulus variable has been 
correlated in time with the experi- 
mental stimulus variable, the alterna- 
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tive hypothesis that the second stim- 
ulus may have served as a cue is 
usually considered pact plausible, 
even if the Ss protest that they were 
not aware of the presence of the un- 
controlled variable. For example, be- 
havior without awareness is consid- 
ered a very plausible explanation of 
above-chance results of those ESP 
experiments in which correlated vari- 
ables could conceivably have served 
as cues (9, 34, 35, 52). 

The impression is apparently wide- 
spread that behavior without aware- 
ness has been adequately demon- 
strated in the laboratory, although 
certain forms of it, e.g., subception 
(4, 12, 29, 30, 39, 40, 44, 45, 46) and 
conditioning to subliminal stimuli 
(1, 2, 26, 50, 62, 74), are generally 
known to be very elusive, to say the 
least. With regard to concept forma- 
tion, for example, Leeper (41, p. 
731) says, “Some of the Ss,..., 
develop the ability to name new ex- 
amples without being able to say how 
they do it, even when the necessary 
formulations lie well within the limits 
of their vocabularies.... In the 
studies of Heidbreder (1924, 1946, 
1947), Smoke (1932), Reese and 
Israel (1935), Snygg (1935), Heid- 
breder, Bensley, and Ivy (1948), and 
Bouthilet (1948) this unconscious 
process is abundantly demonstrated.” 
After discussing additional experi- 
ments, many of which involve ani- 
mals, Leeper concludes, “.. . these 
experiments emphasize that the con- 
trolling processes involved ought to 
be studied primarily in terms of their 
functional properties and only inci- 
dentally, if at all, in terms of whether 


734). 

As this review will show, however, 
only one form of behavior without 
awareness has been established on an 
unquestionable experimental basis. 
Experiments which reputedly have 
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shown the kinds of phenomena which 
are ordinarily meant by ‘behavior 
without awareness” have one or more 
of the following limitations: 

1. Alternative 
plausible. 

2. Negative results have been ob- 
tained upon replication by other ex- 
perimenters. 

3. No replication 
ported. 

4. The effects are so slight that the 
question of whether certain statisti- 
cal assumptions are precisely satis- 
fied becomes critical (this is true even 
when by virtue of large numbers of 
observations the p value is very close 
to zero). 


explanations are 


has been re- 


The latter part of the quotation 
from Leeper expresses an attitude 
that is quite prevalent, even among 
some experimenters who have worked 
on learning without awareness. It is 
an attitude that is directly contrary 
to the importance attached to aware- 


ness by those concerned with psycho- 
dynamics. 

Distinctions which seem: to be of 
special importance in a discussion of 


behavior without awareness are 
whether stimulus discrimination is in- 
volved, whether the behavior is 
learned during the experimental pro- 
cedure or not, and whether, in case 
stimulus discrimination is involved, 
the S knows what it is he is supposed 
to be discriminating. These distinc- 
tions are utilized in the outline which 
follows. 


I. BEHAVIOR INVOLVING STIMULUS 
DISCRIMINATION 
A. Experiments in Which the Subject 
Knows the Specific Nature of the Cue, 
but Is Not Aware of the Usual Sensory 
Qualities nor of the Fact That He Is 
Succeeding in Discriminating 
1. Behavior Not Learned During the 
Experiment. 
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The only kind of behavior without 
awareness which can be easily repli- 
cated is the kind reported in 1884 by 
Peirce and Jastrow (52). Working 
with lifted weights, they provided an 
example of the now familiar fact that 
if Sis forced to make a discriminatory 
judgment, he may do far better than 
chance even though his judgments are 
made with no confidence whatsoever, 
e.g., he thinks that he is merely guess- 
ing because the usual kind of sensory 
experience is lacking. The procedure 
Peirce and Jastrow used was rather 
unusual. A weight was lowered upon 
the tip of S’s forefinger (Peirce) or 
middle finger (Jastrow). S then said, 
“Change,” whereupon the weight 
was either increased or diminished by 
a small amount. S again said, 
“Change,” whereupon the weight was 
restored to its original intensity. S 
judged the order of the changes and 
gave a confidence judgment of 0, 1, 2, 
or 3, 0 meaning that there was “‘ab- 
sence of any preference for one answer 
over its opposite, so that it seemed 
nonsensical to answer.’’ From the 
published report the percentages cor- 
rect for judgments given with 0 con- 
fidence can be computed only for 
Jastrow, and are reproduced in Table 
Re 

In Table 1 all percentages listed 
under 0 confidence are significantly 
greatly than expectation of 50, and 
some are quite appreciably greater. 
These results are given in some detail 
because, although based upon only 
one subject, and a very sophisticated 
one at that, they are similar to results 
that have been reported by investiga- 
tors using other experimental pro- 
cedures with lifted weights or work- 
ing in other sense modalities. For 
example, Fullerton and Cattell (14), 
following the more usual procedure 
of having S lift one weight and thena 
second and then judge whether the 
second was lighter or heavier than 
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TABLE 1 
PERCENTAGE CORRECT FOR JUDGMENTS GIVEN WITH ZERO CONFIDENCE BY JASTROW, 


IN AN EXPERIMENT WITH Two Sets or LirTep WEIGHTS 


Ratio of 
Pressures 


Confidence 


0 1 2 3 


1.015 63 75 60 100 

(176) (68) (5) (1) 

1.030 75 87 96 100 

(141) (83) (24) (2) 

1.060 91 99 96 100 
(56) 


(94) (76) (24) 


Ratio of 
Pressures 


Confidence 





0 1 2 3 
1.005 59 6 — — 
(497) (3) (0) (0) 

1.010 66 os _ 
(558) (62) (0) (0) 

1.020 75 92 — _ 
(526) (74) (0) (0) 





Note.—-The number of judgments is given in parentheses. Percentages for other degrees of confidence are given 


for « omparison 


the first, found that, of those judg- 
ments given with confidence judg- 
ment ‘‘doubtful’’ (which in this ex- 
periment was synonymous with “‘pure 
guess’’), 65% (based on 285 judg- 
ments) were correct for one S and 
60% (based on 405 judgments) for 
the other. Sidis (60) had S guess 
what letter or numeral was printed 
on a card held so far away that S 
thought he was making pure guesses; 
20 Ss obtained about 67% correct 
judgments of whether a letter or a 
number was being shown (expecta- 
tion = 50%), about 25% correct judg- 
ments of the particular character (ex- 
pectation=2.8%, if one considers 
each of the 36 possibilities as equally 
probable), and about 35% correct 
judgments of the particular character 
when S knew which 10 characters 
were on the cards (expectation 
= 10°). Stroh, Shaw, and Washburn 
(65) replicated this experiment with 
13 Ss and obtained from 16% to 74% 
correct judgments (expectation 
=10%). Even when the characters 
(letters only) were enclosed in rec- 
tangles so as to attenuate the effect of 
gross differences in shape or bulk, as 
between “‘B” and “‘T,” the percent- 
age results for 8 Ss were 8, 18, 21, 24, 
34, 37, 63, 67. These investigators 
also reported significant results (from 
10% to 39% correct, for 10 Ss) when 


Computed from Peirce and Jastrow, 1884 


the name of the letter was whispered 
so faintly that ‘no sound whatever 
could be heard at the distance at 
which the observer sat.’” 

Coover (9) used tachistoscopic ex- 
posures of letters or numerals; S indi- 
cated after each exposure whether 
the letter he wrote down was a per- 
ception, an inference based on partial 
perception, or a pure guess. In one 
experiment, the percentage right for 
pure guesses of 35 Ss varied from 0% 
to 56%, with an over-all result of 
9%. In a second experiment, the re- 
sults for 15 Ss varied from 0% to 14% 
with an over-all result of 8%. The 
expectation which Coover quotes in 
both experiments is 2.8% (one out of 
36).5 In two other experiments S 
was told the 10 possibilities. In the 
first, the results for 15 Ss ran from 


‘ The mean percentages are 37%, 34%, and 
20%; however, the number of trials varied 
considerably from S to S in each of three 
experiments. 

6 The character exposed was always one of 
the following ten: BH KU Z24579. The 
expectation for any group of Ss cannot be de- 
termined without a study of their guessing 
habits; therefore, the figure of 2.8% which 
Coover gives has an indeterminate error. 
Coover was aware of the influence of guessing 
habits upon expectation, and he analyzed his 
results to show that guessing habits could 
have raised the expectation to a maximum of 
about 4.6%, leaving the results statistically 
significant. 
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6% to 54% (over-all 13%). In the 
second experiment, results for 9 Ss 
were from 8% to 27% (over-all 15%). 
The number of trials was sufficient 
to be able to assert that many of the 
individual differences were genuine, 
with about 60% of the Ss showing 
the effect with expectation of 2.8% 
and about 20% with expectation of 
10%.* Coover also obtained signifi- 
cant results with playing cards held 
at a distance. With 7 Ss the over-all 
percentage right for guessed color was 
76% (expectation=50%) and for 
number was 45% (expectation 
=10%). When the names of the 
characters were whispered at 25 
meters distance and S guessed, the 
over-all results for 4 Ss were 7% 
(2.8 <expectation <<10%) and when 
S was informed of the 10 characters, 
12%. Both these results must be 
considered of dubious statistical sig- 
nificance. 

Pillai (55) presented letters at a 
distance (40 Ss) and also by whisper- 
ing (60 Ss) and reported that all Ss 
achieved scores significantly greater 
than expected by chance; the results 
have not been published in detail. 
Baker (2) reported as high as 81% 
discrimination of two lines as either 
vertical and horizontal or diagonal, 
when the 4 Ss reported zero confi- 
fidence. In the same paper, Baker re- 
ported as high as 86% discrimination 
of dot-dash or dash-dot, given with an 
audiometer to 4 other Ss. In both 
these experiments Baker found the 
percentage right decreasing as the 
intensity was decreased below the 
highest point at which all confidence 
judgments were zero.’ 

6 The difference can probably be accounted 
for by the fact that the expectation was 
actually higher than 2.8%, as indicated in 
footnote number 5. 

7 Ina replication of this experiment by Hil- 
gard, Miller, and Ohlson (26), no such dis- 
crepancies were obtained. See the comment 


in Section I, A, 2. 


387 


Williams (76) projected a circle, 
triangle, or square very faintly on a 
screen; S guessed which of the three 
figures was being presented; results 
for 11 Ss ranged from 15% to 70% 
right, with over-all 50% (expecta- 
tion = 33.3%) even when they ‘‘saw 
nothing at all.’’"* The 4 highest scor- 
ing Ss were rerun; the results were 
45%, 45%, 50%, and 56% (over-all 
49%). A control series without pro- 
jection of figures gave 29%, 31%, 
31%, and 33%. 

Miller (47) used a similar proced- 
ure, with 5 naive and 5 sophisticated 
Ss. In this section we shall consider 
only the sophisticated Ss, i.e., those 
who knew that one of five possible 
geometric figures was being projected 
but thought nonetheless that their 
guesses were unaided by perception. 
There were 125 trials at each of 4 de- 
grees of illumination, 60 v, 70 v, 80 v, 
and 90 v, all below the conscious 
threshold. The over-all percentages 
were 21, 23, 27, and 35 respectively 
(expectation=20%). The highest 
percentage right for any S was 56, at 
90 v. Vinacke replicated this experi- 
ment with Ss who were dark adapted, 
but obtained negative results (73). 

Since 1939 there seems to have 
been little interest in this kind of dis- 
crepancy between performance and 
expectation, perhaps because of a de- 
clining interest among experimental- 
ists in whether a process is conscious 
or not; indeed, the term ‘‘threshold”’ 


* The same random series of 54 figures was 
exposed more than once to each 5S, only the 
pure guesses being analyzed; therefore, be- 
cause of sequential learning, expectation may 
be slightly greater than 33%. This objection 
can also be raised to the second part of Wil- 
liams’ experiment and is not overruled by his 
control series in which no figures at all were 
projected (and the results were in accordance 
with expectation), for S then had no reference 
points for any sequential learning that may 
have occurred. It seems very doubtful, how- 
ever, that sequential learning can account for 
the large results that Williams obtained. 
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is often defined entirely in terms of 
discrimination, so that the expres- 
sion “subliminal discrimination,” 
which originally meant discrimina- 
tion when S was not consciously 
aware of the usual sensory qualities 
(and therefore, unless very sophisti- 
cated about this phenomenon, was 
unaware that he was able to discrim- 
inate), now seems self-contradictory, 
although of course it can be ‘‘trans- 
lated’’ into a statement about co-oc- 
currence of two kinds of behaviors, 
e.g., saying “I’m purely guessing” 
and at the same time discriminating 
above expectation. 

Behavior without awareness of the 
kind described in this section can 
easily be obtained even as a class ex- 
ercise. Nevertheless, caution must be 
exercised to avoid alternative expla- 
nations in terms of sequential learning 
(which, to be sure, might involve an- 
other type of behavior without aware- 
ness) or by inadequate randomiza- 
tion of the sequences of stimuli. Also, 
the classification ‘“‘pure guess’” should 
not be used too lightly. It is impor- 
tant not only to instruct S appropri- 
ately but to try to understand what 
he means when he uses this, or some 
similar, term. Urban (70) comments 
on this problem. 

There seem to have been no pub- 
lished studies of learning to judge 
one’s uncertainties more accurately 
(i.e., to eliminate the discrepancy be- 
tween confidence judgment and per- 
formance) or to manipulate the dis- 
crepancies through practice, knowl- 
edge of results, reward, or punish- 
ment. There has also been little or no 
attempt to determine the average 
magnitudes of discrepancies between 
expectation and performance for vari- 
ous experimental conditions, or to re- 
late these discrepancies to other vari- 
ables, or to explore the relation of in- 
dividual differences (which tend to be 
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quite large) in these discrepancies to 
other variables.® 

Figure 1 indicates some of the mean 
results obtained under the very di- 
verse conditions used by the investi- 
gators whose work has been cited. 


2. Response Based on Learning Dur- 
ing the Experiment. 

There have been relatively few lab- 
oratory studies of learning to respond 
to subliminal stimuli when S is aware 
of the specific nature of the cue. New- 
hall and Sears (50) paired very faint 
visual stimuli with a shock to the 
right hand. S was instructed to main- 
tain fixation, to watch carefully for a 


® Pauline Austin Adams and the author are 
at present engaged in studies of several of 
these problems. 
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visual stimulus, to report perception 
of the stimulus in all cases of higher 
certitude than a guess, and to “‘let 
the right hand take care of itself.’’ 
On the test trials (without shock), 
there were 8 instances of conditioned 
finger retraction without S’s report- 
ing having seen a light on the same 
trial. The authors believe that the 
most plausible interpretation was 
that the motor response was on these 
occasions more sensitive than the 
conscious, but they point out alterna- 
tive explanations, e.g., that S’s at- 
tention had slipped from the visual 
field to the hand-shock situation and 
that a faint light might have passed 
unperceived because of lowered in- 
tensity due to lowered attention. The 
latter interpretation would involve 
behavior without awareness but 
would not meet the authors’ criteria 
for subliminal conditioning. The 


authors are cautious in drawing any 
conclusion; at best the phenomenon 
was present only in a very weak and 


unstable form. Baker (2), after some 
previous unsuccessful attempts (62), 
reported very large and stable effects 
of pupillary conditioning to sub- 
liminal auditory stimuli, but Wedell, 
Taylor, and Skolnick (74) and Hil- 
gard, Miller, and Ohlson (26) made 
very thorough attempts to reproduce 
Baker’s results without success. 
Lazarus and McCleary (40) pre- 
sented 10 nonsense syllables to 9 Ss, 
pairing shock with 5 of the syllables 
All 10 syllables were then presented 
tachistoscopically at near threshold 
duration; the critical syllables gave 
significantly more GSR, even when S 
failed to identify them verbally, than 
did the neutral syllables. Quite aside 
from the question of adequate repli- 
cation of this experiment and from 
the controversy that has arisen over 
the theoretical interpretation of the 
results (29, 30, 39, 46), Eriksen (12) 
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has pointed out that it is possible 
that if the verbal response by S had 
not been restricted to one of the 10 
syllables, e.g., if S had been allowed 
to say ‘I see a ‘v’ and then a blur and 
then a ‘k’...,”’ the “subception ef- 
fect’ as it had been called might have 
been greatly reduced, perhaps to zero. 


B. Experiments in Which the Subject 
Believes Thata Cue Is Being Given and 
Knows the General Nature of the Cue, 
e.g., That It Is Visual, but Does Not 
Know Its Specific Nature 


Experiments in concept formation 
are sometimes cited, as by Leeper in 
the previous quotation, as showing 
discrimination without awareness. 
In these experiments the stimulus is 
clearly supraliminal and S knows that 
some cue is being given and knows 
(at least for some of the concepts) 
what the general nature of the cue is. 
Evidence for discrimination without 
awareness consists of a greater num- 
ber of correct identifications of in- 
stances of a concept than can be ac- 
counted for by the S’s verbaljzation 
(or drawing), assuming that the nec- 
essary formulation would lie within 
the limits of the S’s readily available 
vocabulary (or ability to draw). Ac- 
tually, none of the published reports 
ol concept-formation experiments of- 
fer proof of discrimination without 
awareness in this sense, although it is 
asserted by some of the authors to 
have occurred. In Hull's classic study 
(31), in which Chinese characters 
containing the radical were 
paired with the same nonsense sylla- 
ble, there is no evidence presented 
that the presumably inadequate draw- 
ings of radicals by the Ss were not a 
sufficient basis on which to differenti- 
ate the different groups of characters. 
Similarly, in Smoke’s study (63), it 
is not shown that the verbalizations, 
however much they differed from 


same 
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those with which the experimenter 
started, would not have served to dif- 
ferentiate the groups of geometric 
drawings. In Heidbreder’s 1924 
study (20), cited by Leeper, no as- 
sertion is made that the Ss were un- 
aware of the criteria for making the 
figures (which for purposes of this 
discussion can be considered func- 
tionally equivalent to identifying 
members of a class); Heidbreder as- 
serts merely that the Ss were unaware 
of certain kinds of behaviors (‘‘spec- 
tator’’ and “participant” behaviors) 
in which they engaged while arriving 
at the solutions. In Heidbreder’s 
later papers (21, 22, 23, 24) it is re- 
ported that correct identifications 
preceded correct formulations in 
some cases, but, as with Smoke’s re- 
port, the evidence that the formula- 
tions were an inadequate basis for 
classification is not presented. 

The Rees and Israel study (58), 
also cited by Leeper, deals with a 
kind of phenomenon that is at least 
superficially different from the kind 
of discrimination without awareness 
reported obtained in the concept- 
formation studies, and will be dis- 
cussed in Section I, D. Of the other 
papers cited by Leeper, those by 
Snygg (64) and by Bouthilet, the 
second is unpublished and the first is 
unrelated to discrimination without 
awareness, as far as this reviewer can 
determine. 

Goodnow and Postman (15) had 
Ss decide which of two variations (ad- 
ditive and subtractive) matched a 
given geometric design. In the in- 
structions the additive and subtrac- 
tive variations were explained to S, 
and he was “‘led to believe that the 
correct choice depended on the na- 
ture of the designs and that there was 
a general principle governing correct 
choices” (15, p. 17). E said “Right” 
or ‘‘Wrong”’ after each choice. After 
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every other block of 10, S was asked, 
“How did you decide which card to 
choose?’ Six groups were run with 
varying proportions (.5, .6, .7, .8, .9, 
1.0) of additive designs designated 
“Right” by E. The groups tended to 
match, in their choices, the propor- 
tions of additive designs designated 
“Right” by E£, yet, “Their verbal re- 
sponses gave no indication that they 
perceived the problem as one of prob- 
ability discrimination” (15, p. 21). 
However, the Ss were apparently not 
asked whether they did in fact choose 
additive designs more often. In view 
of the instructions, it is not surprising 
that ‘only three of the 40 Ss in the 
partial reinforcement groups aban- 
doned the search for a lawful solution 

. and treated the task as one of 
discriminating probabilities” (15, p. 
19). As there are many hypotheses 
(e.g., sequential hypotheses) which 
produce a given proportion of addi- 
tive responses (e.g., the hypothesis 
SAASA gives 60% additive choices) 
there is no evidence presented in the 
report that there was any discrepancy 
between what the Ss did and what 
they said they were doing. That Ss 
sometimes match probabilities by 
hypotheses about sequences is shown 
in a paper by Hake and Hyman (18). 


C. Experiments in Which the Subject 

Believes That a Cue Is Being Given, 

but Does Not Know Even Its General 
Nature 


Postman and Jarrett (56), using a 
procedure similar to that of Thorn- 
dike and Rock (see next section), had 
Ss respond to each word given by E&, 
saying ‘‘Right’’ when S’s word was 
connected sequentially with /’s word 
and ‘‘Wrong”’ when S’s word was un- 
related, connected denotatively, etc. 
One group was told the principle at 
the end of the first block of 20 words; 
the uninformed group was required 
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to attempt a statement of the princi- 
ple at the end of each block. The 
authors reported that those unin- 
formed Ss who were able at some 
point during the experiment to ver- 
balize the principle correctly showed 
a small but significant amount of 
learning prior to verbalization; the 
improvement was from about 2.3 to 
about 3.5 words right per block 
(about 11% to 18%). The authors do 
not deal with the problem of aware- 
ness of correlated hypotheses; they 
admit however, “the results... 
show only little reliable evidence for 
learning without awareness” (56, pp. 
254-255). 

Philbrick and Postman (54) pre- 
sented a sequence of 216 words to 
each of 48 Ss, the S being told that a 
number from 1 to 9 had been paired 
with each word and that he was to 
try to guess what the number was on 
each trial, FE saying “Right” or 
“Wrong.”’ Most Ss apparently as- 


sumed that some cue was being given, 


especially as each S was asked to 
state the principle on which he was 
basing his responses after each block 
in which he had 4 or more “right” 
(expectation =1). The authors report 
that the 20 Ss who verbalized the 
principle (the number of letters in 
the word minus one) averaged 42% 
correct in the block of 9 words pre- 
ceding the block after which correct 
verbalization occurred, compared 
with an expectation of 11%. The 28 
Ss who did not verbalize correctly 
had a slight but significant rise in 
their average performance curve. 
However, Philbrick and Postman 
point out that there were hypotheses 
which were partly related to word 
length, and in their report they do not 
eliminate the possibility that the 
above-chance performances can be 
accounted for by the Ss’ awareness of 
these related hypotheses. 
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D. Experiments in Which the Subject 
Does Not Believe That a Cue Is Being 
Given 


1. Behavior Not Learned During the 

Experiment 

In 1900 Dunlap (11) reported that 
the Muller-Lyer illusion had been ob- 
tained with 4 Ss by making the 
obliques merely subliminal shadows, 
so that S thought he was simply bi- 
secting a horizontal line segment. 
Titchner and Pyle (69) replicated 
this experiment with negative results. 
Manro and Washburn (43) also re- 
ported negative results, though their 
paper includes some positive evi- 
dence. Both Hollingworth (28) and 
Bressler (3) reported positive results; 
since 1931 the controversy seems to 
have been dropped. 

In 1939 Miller (47) reported an ex- 
periment in which S guessed (by 
“telepathy’’) geometric figures while 
staring into a mirror (‘as into a 
crystal ball’’) on the back of which 
the figures were being projected at 
very low intensities without S’s 
knowledge (these were the ‘‘naive’’ 
Ss in the experiment referred to in 
Section I, A). At the highest inten- 
sity the 5 Ss averaged 45% correct, 
compared with expectation of only 
20%; one S achieved 91% correct. 
These Ss all showed surprise when 
told that the figures were actually 
being projected. No replication of 
this part of Miller's experiment has 
been reported. 

Perky’s classic study (53) in which 
Ss were asked to draw various ob- 
jects which they were to imagine ona 
ground-glass screen, but which were 
actually being projected, gave clear- 
cut evidence of discrimination with- 
out awareness, but has apparently 
never been replicated, except insofar 
as the studies by Miller can be con- 
sidered a replication. The stimuli 
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used in Perky’s experiment, however, 
were clearly supraliminal for Ss who 
knew 

jee ted; 


that images were being pro- 

various precautions were 
taken to prevent naive Ss from be- 
coming aware, such as making the 
edges of the images uneven, moving 
the image to and fro slowly, and dis- 
tracting the subject and removing the 


image as soon as the S began his de- 
scription. 

Sidis (60) gave each of 20 Ss the 
task of reproducing a complicated 


drawing, then presented a piece of 
cardboard from which S was to choose 
one of the digits 26471538 “to break 
up attention.’ On the margin was 
written one of the digits. The Ss did 
not notice that the digits were on the 
margin, but tended to choose them 
39.4% of the 

12.5%). 

Coover (9) exposed tachistoscopi- 
cally a card with a capital in the lower 
right corner, a digit in the upper left 
corner. SS was told only to look for a 
capital letter in the lower right corner, 
and then to put down any digit that 
should ‘‘come to mind.’’ With 26 Ss 
the coinciding 
with the upper left digit varied from 
9.5% to 36.7%, with a total of 15.4% 
(expectation = 10%). (oover re- 
ported that 15 of the Ss showed the 
effect. However, in view of Coover’s 
statement that the S usually did not 
know a digit was being presented 
with the letter, results are 
rather inconclusive as to discrimina- 
tion without knowing that a cue is be- 
ing given. 

Collier (8) repeated Coover’s ex- 


time (expectation 


percentage  ¢ hoices 


these 


periment, with some modifications. 
A figure similar to a Landolt Circle 
was exposed tachistoscopically at 
the fixation point and one of six geo- 
metric forms was exposed simultane- 
ously in the periphery. S had to 


guess the position of the break in the 
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circle and to name one of the six geo- 
metric forms—‘‘the first one that 
comes to mind.”’ Even though they 
did not know a form was being pre- 
sented, the 10 Ss gave the peripheral 
figure on 22.2% of the trials, com- 
pared with expectation of 16.7%, 
This was the largest of Collier's re- 
sults, with peripheral figures of 10 
minutes of arc; figures of 15 minutes 
and 30 minutes gave smaller results.!° 


2. Response Learned or Strengthened 
During the Experiment 


In a second study by Miller (48), 
using the same apparatus and general 
procedure as the one previously de- 
scribed, rewards and punishments 
were varied among four groups and 
differences in improvement were ob- 
tained. The group showing the great- 
est improvement gave 63% correct 
compared with an expectation of 50% 
(only two figures were used with this 
group); all 4 Ss in the group are re- 
ported to have learned. 

In his experiments on extrasensory 
perception and related phenomena 
Kennedy (34) found that one S 
among 100 college students tested 
with the ‘open matching method” 
made 275 hits in 1,000 matches (ex- 
pectation = 200; p<.00001). S was 
allowed to look closely at the backs 
of the cards and to tilt them to get 
reflections; he reported throughout 
that he was not aware of using visual 
however, when questioned 
about his close scrutiny and tilting 
of the cards he replied that this was 
necessary or ‘there would be nothing 
to goon” (34, p. 150). Kennedy con- 


cues; 


0 Collier 
obtained by 


relates this finding to a result 

Coover that discrimination of 
digits 8 degrees in the periphery was better 
than at 6 degrees or 4 degrees. The percent- 
ages right in Coover'’s experiment were 15%, 
12%, and 13% respectively. At 2 degrees the 
percentage went up to 22%, but the Ss be- 
came aware. 
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cludes, ‘The important point is that 
they (the cues) may be used sub- 
liminally by naive Ss who glance at 
the backs of the cards’’ (34, p. 150). 
Kew psychologists would disagree 
with this conclusion, but the fact re- 
mains that none of Kennedy's 100 
Ss gave clear-cut this 
phenomenon; the phenomenon used 
in the explanation is almost as diffi- 
cult to obtain in the laboratory as 
ESP itself. 

In 1932 Thorndike reported in the 
Fundamentals of Learning (67) a 
number of experiments" dealing with 
learning without under 
the chapter title, ‘‘After-effects with- 
out revival.’’ Thorndike’s major pur- 
pose in these experiments was to 
eliminate the possibility of rehearsal 
(or “inner repetition’) and thus to 
have the opportunity of showing that 
the aftereffects of satisfyingness can 
strengthen a connection even without 
the advantage of frequency. In his 
discussion of the results, however, he 
emphasized also the direct ‘‘confirm- 
ing’”’ influence of the reward as op- 
posed to an effect mediated by ideas 


instances of 


awareness, 


(awareness). This position of Thorn- 
dike is well known; it should be dis- 
tinguished from the assertion that a 
given kind of behavior can be main- 
tained at a high consistency without 
the individual’s becoming aware. He 
would presumably have made the 
latter assertion with respect to many 
kinds of behavior (e.g., playing ten- 
nis, recognizing faces), but to this re- 
viewer's knowledge he did not discuss 
the range of behaviors with respect to 
which it would apply. He was aware 
that the effects of learning without 
awareness which he obtained in the 
laboratory were, in general, slight. 

Thorndike reported several experi- 

1! Some of these did not involve stimulu 


discrimination and will be discussed in Section 


iI. 
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ments involving stimulus discrim- 
ination. In one of these, S completed 
mutilated words, each of which could 
be completed in more than one way. 
During training E said ‘‘Right’’ when 
a dot following an a was filled by a v, 
a dot following a b was filled by an J, 
etc., and ‘‘Wrong,”’ otherwise. Thorn- 
dike reported that 3 of 8 Ss learned 
to respond without awareness of the 
rules; the data which he presents sup- 
port the conclusion for the training 
series (which can be accounted for by 
memory of specific words, as the same 
words were presented at least 14 
times) but not for the initial vs. final 
test trials."* Another procedure used 
by Thorndike was to present S with 
a list of rare words (in one experiment, 
Spanish words) and instruct him to 
underline a synonym for each, from 
five possible choices. / announced 
“Right” or ‘Wrong’ after each 
choice. The words were so arranged 
that in each 100 trials the correct 
word occurred most frequently in the 
first place to the right, less frequently 
in the second place,..., least fre- 
quently in the first place to the left. 
The Ss were questioned at the end, 
and only those unaware that position 
had any relation to correctness were 
included in the analysis of results. 
The result upon which Thorndike re- 
lied most was a test of position 
choices before vs. after training. 
Choices of positions 1 and 2 (from the 
left) declined from 43% to 37%; 
choices of positions 4 and 5 increased 
from 38% to 43%. In an experiment 
with Spanish words, comparable fig- 
ures are a decline from 52% to 40% 


2 Thorndike states that the initial vs. final 
test yielded a significant difference, but the 
data which he presents do not support this 
conclusion. Ironically, he obtained a signifi- 
cant initial vs. final difference for a control 
group (experiment 57) with forced repetitions 
and no announcement of right or wrong, but 


nothing is said about the Ss’ awareness 
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and an increase from 27% to 41%. 
An alternative explanation for these 
results is that the accuracy of the S’s 
discrimination of the rare (or Span- 
ish) words improved with training. 
This explanation is not only plausi- 
ble; it is supported to some extent by 
the data themselves.'* In other ex- 
periments S was asked to judge, on 
each of several hundred trials, which 
of four lines was longest. Two hun- 
dred cards were used, of which 20 had 
numerals in the middle of the card, 20 
had numerals at the right hand end, 
20 had one of the lines drawn two 
millimeters wider than the other, etc. 
The actual differences in length were 
“imperceptible,”’ but / put the iden- 
tifiable features of the cards into 
100% relation with the position of 
the line, announcing which line was 
“right” after each trial. S began by 
trying to see the difference, but us- 
ually gave up careful scrutiny after 
the first few trials and began to guess 
quickly. In some experiments, how- 
ever, the differences in length were 
made greater, so that S 
could actually get about 44% right 
without regard identifiable fea- 
tures of the cards. Thorndike reports 
increases in the percentage of Right” 
responses, without awareness of the 
use of the cues; however, the results 
are reported rather sketchily and are 
difficult to evaluate, as most of these 
experiments do not meet 


somewhat 


to 


present 


1% This explanation helps to account for the 
different results for Ss Alll and BI and the 
failure of Bl and BII to show the tendency to 
underline toward the right in experiment 50. 
The data on page 214 show considerable dis- 
crimination during training, and Thorndike 
indicates in a footnote on page 225, in con- 
nection with another procedure, that the Ss 
did have some knowledge of the meanings of 
rare English words. ‘The results of the ‘‘sub 
tle’’ analysis of experiment 49 (pp. 215-216) 
can be accounted for by the tendency to drop 
out wrong 
crimination. 


responses or by increased dis- 
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methodological standards, particu- 


larly with respect to controversial 
There are three main objec- 


topics. 
tions: 

1. The possibility is not always 
eliminated that an improvement in 
discrimination of‘‘imperceptible’”’ dif- 
ferences has not occurred (which 
would at best place some of these ex- 
periments into Section I, A, 2). 

2. Evidence of lack of awareness is 
often not sufficiently complete. 

3. The possibility is not always 
eliminated that Ss simply learned to 
respond to particular cards. 

Experiment 61 (pp. 239-240), with 
cards and procedure as described 
above, seems to be most nearly free 
of any valid criticism.’ After train- 
ing, Ss averaged 33.1% correct 
choices compared with 25.1% before 
training (expectation = 25%). 

Thorndike and Rock (68) told S 
to respond to a word given by & with 
the first word that he thought of, and 
that / would say “Right” when the 
response word was any of 10 or more 
arbitrarily selected words, and that 
he would receive a money bonus 
based-both on speed and on the num- 
ber ‘‘Right.”’ Actually E said‘‘Right”’ 
if the response word was “‘clearly due 
to sequential connections used in 
speaking or writing’’ and ‘‘Wrong”’ if 
the response was “‘clearly due to con- 
nections used in getting the word’s 
meaning.’ As the Ss were not ques- 
tioned at the end of the experiment, 
it is impossible to say how many of 
them assumed that some cue was be- 
ing given. Thorndike and Rock re- 
ported that, of the 23 Ss who showed 
some learning (of a total of 30), only 


4 The only question is whether ‘Right’ 
was called as often for line 1 as for line 2, etc.; 
if not, the performance of the Ss may have 
been a case of “probability discrimination” 
similar to that studied in Thorndike’s experi- 
ments discussed in Section II, B. 
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one S may have become aware. The 
criteria used for awareness, however, 
were suddenness of discrimination 
and 100% responding. Irwin, Kauf- 
man, Prior, and Weaver (32) showed 
that these criteria are not adequate, 
as their Ss continued to improve after 
being taught the distinction and some 
did not attain a high level of dis- 
crimination. 

The replication of this experiment 
by Postman and Jarrett has been dis- 
cussed in Section I, C, as it is clear 
in the Postman and Jarrett procedure 
that S knew that some cue was being 
given. 

Cohen, Kalish, Thurston,and Cohen 
(6) used as Ss 40 male ambulatory 
patients, all under 55, from a general 
medical population, excluding neur- 
ology and psychiatry. Each S was 
shown 80 cards, each containing one 
verb and six pronouns, and was told 
to make up a sentence beginning with 
one of the pronouns and including the 
verb. FE said “Good” after each 
sentence which S began with ‘“‘I”’ or 
“We,” beginning with the second 
block of 20 cards. The authors report 
that questioning of Ss at the end of 
the series revealed no awareness of 
the contingency between their re- 
sponses and those of EF, although 
their use of “‘I’’ and ‘‘We’”’ increased 
from 44% on the first 20 trials to 64% 
on the last 20, compared with a con- 
trol group’s change from 43% to 
41%. In a second experiment the in- 
crease was from 41% to 55%." 


‘6 The authors state that each .‘S was asked 
which pronoun he had used most, and then 
asked whether E’s behavior influenced his 
choice of a pronoun. It is reported that the 
Ss of experiment I were not aware of the con- 
tingency between their responses and those of 
E, but no information is given about the Ss of 
experiment II in this respect, nor is it reported 
whether the Ss in either experiment were 
aware that they used “I” and “We” 
often than the other pronouns 


more 
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Klein (36), using essentially the 
same procedure as Cohen et al., ob- 
tained both conditioning and extinc- 
tion without awareness. His Ss were 
80 neuropsychiatric patients in one 
experiment and 30 in another. Taffel 
(66) used this procedure with 90 psy- 
chotic and neurotic male patients and 
obtained differences in conditioning 
which were directly related to scores 
on the Taylor manifest anxiety scale. 
The “high’’ anxiety group increased 
their ‘‘I’’ and ‘‘We”’ responses from 
41% on the first 20 trials to 76% on 
the fourth 20 trials, the ‘*medium’”’ 
group from 42% to 69% and the 
“low” group from 39% to 46% (not 
significant). 

In none of the last three papers is 
there much information given about 
what the Ss thought they were do- 
ing, nor is the problem of awareness of 
correlated hypotheses discussed. 

Positive results of conditioning to 
stimuli presented to hysterically an- 
esthetic areas and not consciously 
perceived by S have been reported by 
Sears and Cohen (59) and by Cohen, 
Hilgard, and Wendt (7). Sears and 
Cohen established a CR to a quick 
brush of a whisp of cotton across the 
back of the hysterically anesthetic 
left hand of a patient, though the pa- 
tient denied feeling anything. Shock 
was used as the Syne and the CR was 
obtained by first establishing a CR 
to a sharp rap with a pencil, which 
the patient could feel. The CR to the 
cotton extinguished in one trial, then, 
after reconditioning, in two trials. 
After questioning of the patient and 
denial on her part that she had felt 
the cotton, further trials evoked both 
the CR and conscious awareness of 
the stimulus. The anesthesia disap- 
peared completely at this time and 
had not returned six months later. 
Sears and Cohen had been unable to 
obtain a CR by simply pairing the 
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cotton with shock, or by first estab- 
lishing a CR of the right (nonanes- 
thetic) hand. 

In the study by Cohen, Hilgard, 
and Wendt it was shown first that 
eyelid reflexes to sound could be very 
consistently reinforced or inhibited 
by presenting a light to the blind area 
either .045 sec. (for reinforcement) or 
.225 sec. (for inhibition) before the 
sound was presented. Not once did 
the S report the light. A conditioned 
response to light in the blind area 
was then established by using a puff 
of air as the Syne. Next a conditioned 


verbal response to light was estab- | 


lished'*® by pairing light with a sound 
to which the patient had been trained 
to shout “Light.”” This last CR was 
slow to develop and very weak, per; 
haps understandably, as the patient 
saw nothing to shout about. 

In his studies of salivary condition- 
ing Razran (57) has shown that both 
conditioning and generalization can 
occur with no awareness that more 
salivation is elicited by the S, and by 
closely related (phonetographically or 
semantically) stimuli than by other 
stimuli. Making the Ss aware of what 
was happening or having them adopt 
conscious facilitory or inhibitory at- 
titudes seemed to have little effect. 

Diven (10) instructed his Ss to re- 
spond to each stimulus word with the 
first word that came to mind, contin- 
uing to call words until told to stop. 
The Ss were also told, ‘Don’t try to 
make sense of the task,”’ (10, p. 295). 
Forty stimulus words were given, 
with 12 sec. between each stimulus 
word and “‘stop.”’ A 1-sec. shock was 
administered after the word ‘‘barn,”’ 
which was presented six times during 
the list. After five minutes S was 


16 A previous study by Hilgard and Wendt 
(27) had failed to obtain this effect in a case 
of hemianopsia following surgical resection of 
the left occipital lobe. 
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asked to recall the list, then the list 
was presented without shock, then S 
was again asked to recall. Several 
groups were run, with varying pe- 
riods between the first and second 
sessions. Diven reported that 21 
of 52 Ss remained unaware of the 
fact that ‘‘barn”’ was the signal for 
shock, although their physiological 
indices of anxiety in response to this 
stimulus had increased. He also re- 
ported that the increment in the in- 
dex due to an increased time interval 
between sessions was greater for the 
unaware Ss than for the aware Ss, 
and that the unaware Ss extinguished 
less readily. 

Haggard (17), utilizing Diven’s 
technique, instructed his Ss to re- 
spond to each stimulus word with re- 
lated words. After 10-12 sec., E 
either gave a strong shock or said 
“stop.’’ Forty-two stimulus words 
were given, of which 5 were the word 
‘““sword,”’ 5 were “‘sharp,”’ 10 were ad- 
ditional war words, 10 were peace 
words, 5 were buffer words following 
“‘sword,”’ and 7 were neutral words. 
S was shocked only after “‘sword.” 
The word “sharp” always preceded 
“sword.”’ At the end of the condi- 
tioning session Ss were asked whether 
they could predict the shock; 7 (of 
18) were unaware when it came, but 
were certain it had not come after 
any given word.'’ There was a signifi- 
cant increase in GSR to the precriti- 
cal word “sharp’’ by the unaware 
group; this increase was 2.6 times 
that shown by the aware group. 

Lacey and Smith (37), pointing 
out certain flaws in Diven’s and Hag- 
gard’s experiments, told Ss to chain 


17 Haggard (17) presumably means stimulus 
word but does not say so explicitly. Two Ss, 
who were uncertain whether shock came after 
“sword” or some other war word, were placed 
in the unaware group “for convenience in 
treating the data” (17, p. 261). 
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associate to each stimulus word until 
E said “stop” (after 15 sec.) and to 
tap meanwhile on a telegraph key at 
an even rate. In Group I a strong 5- 
sec. electric shock was given within 
.5 sec. after ‘‘stop’’ when S had been 
chain associating to ‘‘cow.”’ In Group 
II the shock was given after S had 
been chain associating to “paper.” 
“Cow” and “‘paper’’ were each pre- 
sented 6 times; of the remaining 28 
words, 14 were rural and 14 were non- 
rural words (Diven had reported gen- 
eralization to rural words). The 
measure of conditioning was increase 
in heart rate during the 15 sec. of as- 
sociating over the previous 15 sec. 
(maximum minus maximum). Of 31 
Ss, 22 were unaware of the fact that 
the shock followed chain association 
to a certain word; of these, 10 were in 
Group I and 12 were in Group II. 
Seven of the 10 unaware Ss in Group 
I and 8 of the 12 unaware Ss in group 
II showed a greater heart rate during 
association to the critical word than 
association to noncritical words. 
Group I unaware Ss also showed gen- 
eralization to other rural words in the 
list. In a further analysis Lacey, 
Smith, and Green (38) showed that 
the aware Ss immediately developed 
a strong emergency reaction that did 
not increase with the number of rein- 
forcements but instead showed a 
gradual adaptation. Unaware Ss 
showed typical conditioning curves, 
at a much lower level of autonomic 
activity and discrimination than the 
aware Ss. It was also shown that the 
unaware Ss had generalized more 
than the aware Ss. These very in- 
teresting and important results have 
not, to the reviewer's knowledge, 
been replicated. 

Eriksen and Kuethe (13) instructed 
their 31 Ss to give an association to 
each stimulus word as quickly as pos- 
sible, and that they would be shocked 
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if their association was too slow for 
that particular stage of practice and 
also under a second condition which 
they might discover during the ses- 
sion and thus avoid the shock. Five 
of the 15 stimulus words were ran- 
domly designated as critical; on the 
first trial the Ss’ associations to these 
five words were followed by a painful 
electric shock to the ankle. On suc- 
ceeding trials S was shocked when- 
ever he repeated one of the originally 
shocked associations, until a total of 
10 trials or two successive shockless 
trials had occurred. The S was as- 
sured that there would be no more 
shock and was then asked to give 
chained associations to the stimulus 
words, being allowed 15 -seconds for 
each chain. Finally S was inter- 
viewed to determine the level of 
awareness of the method of avoiding 
shock. Eleven Ss could be classified 
as having a high level of awareness 
and another 11 as having a low level 
of awareness; the latter were unable 
to state any reasons for the shock 
other than the misinformation given 
them at the beginning. The low-level 
group, however, showed a drop in 
repetitions from 42% on the second 
trial to only 13% on the seventh 
trial for the critical words compared 
with 63% and 60% respectively for 
the noncritical words; their avoid- 
ance behavior in terms of percent- 
ages was not significantly different 
from the high-level group. An addi- 
tional finding of considerable interest 
in this experiment was that although 
the low-level group showed even 
greater avoidance on the second and 
third trials than the high-level group, 
their reaction times tended to drop, 
whereas the reaction times of the 
high-level group tended to rise. Con- 
scious avoidance seemed to take 
longer than automatic, low-level 
avoidance. Both groups gave about 
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50% of the first-trial responses to 
critical stimuli during the chained 
associations; this compared with 
about 70% of the first-trial responses 
to noncritical stimuli. 

Hildum and Brown (25) conducted 
an attitude survey over the tele- 
phone, asking each S 15 questions 
concerning the Harvard philosophy 
of General Education (sic). One group 
received ‘‘Good”’ after “‘pro’’ answers, 
another ‘‘Good”’ after ‘‘anti’’ answers, 
another ‘‘Mm-hmm"” after “pro,”’ and 
a fourth ‘““Mm-hmm”’ after “‘anti.”’ 
The “Good—pro” group gave signifi- 
cantly more favorable responses than 
the ‘‘Good—anti"’ group, whereas the 
difference for the ‘‘Mm-hmm” groups 
was not even in the predicted direc- 
tion. Eight of the 20 ‘‘Good”’ Ss were 
aware that / had said ‘‘Good”’ 
whereas only one of the 20 ‘‘Mm- 
hmm’’ Ss was aware of ‘‘Mm-hmm.” 
The authors, who show considerable 
awareness of the problem of accurate 
determination of awareness, point 
out that awareness of F's behavior 
may in some way account for 
the greater effectiveness of ‘‘Good,” 
even though the denial, by all the Ss, 
that they were influenced by E be ac- 
cepted at face value. The question 
might be raised whether the Ss, who 
were Harvard undergraduate and 
graduate students, may have felt that 
they should not have been influenced ; 
if so, the denial should probably not 
be taken at face value. 

Nuthmann (51) presented Ss with 
a personality test containing 100 ac- 
ceptance-ol-self items and 40 buffer 
items. Lach item was presented on a 
card; S then pressed a ‘‘true’’ or a 
false’ switch. The 45 Ss had all had 
the test previously and had scored in 
the lowest 15% of a group of 420 stu- 
dents. In one group an acceptance- 
of-self response (which could be either 
“true” or “false,” 


depending upon 
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the item) was followed by ‘‘Good"’; 
in another group a light came on; in 
the control group no reinforcement 
was given. The 15 ‘‘Good” Ss in- 
creased from about 67% acceptance- 
of-self responses during the first 20 
critical trials to about 77% such re- 
sponses during the last 20 critical 
trials (each block contained an addi- 
tional 8 buffer-item trials), signifi- 
cantly higher than the light and con- 
trol grouns, which declined slightly 
from about 65% to about 64% ac- 
ceptance-of-self responses. The 
“Good” Ss were asked, “Did you 
think that anything I said had any- 
thing to do with your responses?” 
Five verbalized the purpose of the ex- 
periment correctly; these had _ re- 
sponded at a slightly but insignifi- 
cantly higher level throughout than 
the unaware Ss; four of the five vol- 
unteered the information that they 
were quite sure that they had not 
been influenced by the reinforcement. 
It is unfortunate that a comparison 
is not presented between the unaware 
“Good” Ss and the control Ss; the 
author points out that not all the Ss 
in the “Good” group increased in ac- 
ceptance-of-self responses over the 
five blocks. Correlated hypotheses 
are mentioned to the extent of stating 
that most of the unaware Ss verbal- 
ized a “contradictory’’ hypothesis, 
many stating that reinforcement oc- 
curred whenever their response was 
consistent with that of the previous 
testing. Differential memory effects 
may have entered in to make this 
hypothesis less contradictory than it 
seemed. Four of the “Light” Ss 
verbalized the purpose correctly, yet 
did not show a significant increase in 
acceptance-of-self responses. 

In the Rees and Israel experiment 
(58) referred to in Section I, B, Ss 
were given a training series of 15 ana- 
grams and then a critical series of 15. 
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In the critical series each anagram 
had two solutions, one being the rear- 
rangement of letters according to the 
scheme 54123. The training series 
for the 10 experimental Ss consisted 
of anagrams which had only the solu- 
tion 54123; the training series for the 
10 control Ss had various solutions. 
Six of the 10 experimental Ss had no 
idea that the solutions involved any 
regular order of rearrangement; yet 
about 90% of their solutions were 
54123, the comparable figure for the 
control group being about 50%. This 
experiment is usually discussed under 
the heading of ‘‘set’’; what seems to 
be involved is the establishment of 
visual-motor behavior without con- 
scious awareness by S, perhaps simi- 
lar to the complex visual-motor be- 
haviors involved in reading and in 
many sports. 


II]. BEHAVIOR Not INVOLVING 
STIMULUS DISCRIMINATION 


In experiments not involving stim- 
ulus discrimination, in place of the 
variable of whether S believes a cue 
is being given we have the variable of 
whether SS believes that his task is to 
discover a method of responding. 
With one exception all the experi- 
ments reviewed in this section 
pend upon learning during the experi- 
ment. 


de- 


A. Experiments in Which S Believes 
That His Task Is to Discover a 
Method of Responding 


Sidowski (61), using a procedure 
similar to that of Greenspoon (16), 
instructed Ss to say any words they 
could think of, then pointed out a 
light bulb mounted in front of S$ 
which could be made to blink, telling 
S it was his job to make the bulb 
blink as many times as possible dur- 
ing the experiment. Each plural 
word given by S was followed by a 
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blink. After 325 responses each S was 
asked, ‘‘Were you aware of the pur- 
pose of the experiment?” and ‘‘Were 
you aware of the purpose of the 
light?’” Only 3 of 13 Ss stated that 
the purpose was to condition plural 
words, and the remaining 10 showed 
an increase from about 23% plurals 
during the first 65 words to about 
35% during the last 65 words. How- 
ever, Sidowski does not discuss the 
problem of correlated hypotheses. 

Weiss (75) instructed his Ss to re- 
spond to each word from E with a 
word or phrase; S was told that E 
would say ‘‘Right’’ or ‘‘Wrong”’ ac- 
cording to whether S’s word fell into 
a certain category. The category 
chosen was that of living things. 
Neither a timed group (‘‘respond as 
quickly as you can’’) nor an untimed 
group differed significantly from a 
control group (untimed with no rein- 
forcement). The use of discrete trials 
may be an important difference from 
Sidowski’s and Greenspoon’s experi- 
ments. 


B. Experiments in Which S Does Not 
Believe That His Task Is to Discover 
a Method of Responding 


1. Response Not Learned During the 
Experiment 


Kennedy (33), replicating an ex- 
periment by Hansen and Lehmann 
(19), got Ss to whisper without 
awareness that they were doing so. 
Each S was given kinesthetic (‘‘men- 
tal shouting’’) instructions or ‘‘visu- 
alization”’ instructions; in both cases 
unconscious whispering occurred. 


2. Response Learned or Strengthened 
During the Experiment 


In the same chapter of Fundamen- 
tals of Learning referred to in Section 
I, D, Thorndike reported several ex- 
periments not involving stimulus dis- 
crimination. His basic procedure was 
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to present S with a series of slips of 
paper, asking him to estimate the 
length of each to the nearest quarter- 
inch. Certain lengths were presented 
more frequently than others. £ an- 
nounced ‘‘Right’’ or ‘‘Wrong”’ after 
each guess. A similar procedure was 
used for the judgment of areas. At 
the conclusion of his judgments, each 
S was asked whether he had thought 
that any lengths occurred more often 
than others, and, if so, which ones. 
Thorndike found that those lengths 
or areas which were right more fre- 
quently began to be given more fre- 
quently even to slips of paper of some 
other magnitude. These results, how- 
ever, can be accounted for either by 
the learning of specific cards (in some 
experiments) or by improved dis- 
crimination of magnitudes. 
Greenspoon instructed Ss to say 
all the words they could think.of, to 
say them individually, and not to use 
any sentences or phrases (16). By 
saying ‘‘mmm-hmm” after each plural 
response, during the first 25 minutes, 
he more than doubled the percentage 
of plurals for 14 Ss, compared with a 
control group which had no contin- 
gent response from FE. When ques- 
tioned at the end, none of the 14 Ss 
verbalized the relation between the 
contingent response and the response 
which it followed (one S had been 
dropped because he did verbalize the 
relation), although presumably the 
Ss were aware that / had been saying 
“mmm-hmm.”’ With another group, 
the contingent response followed non- 
plural responses, but did not increase 
them. Similarly, ‘‘huh-uh’’ decreased 
plurals but not nonplurals. Green- 
spoon points out that the differential 
effect may depend upon size and/or 
heterogeneity of the response class. 
Greenspoon does not discuss the 
problem of correlated hypotheses. 
Sidowski (61), in the same experi- 
ment referred to in Section II, A, had 
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one group of Ss who were given the 
light-blink without any mention of it 
in the instructions. Only 3 of the 19 
Ss in this group verbalized the pur- 
pose of the experiment correctly, yet 
the remaining 16 showed an increase 
in plurals from about 15% during the 
first 65 words to a maximum of about 
27% during the third block of 65 
words, significantly better than con- 
trol groups without reinforcement of 
plurals. As with the group previously 
discussed, no mention is made of the 
problem of correlated hypotheses. 

Verplanck has reported an operant 
conditioning experiment in which 
various Es increased the rate of opin- 
ion-stating of their Ss by reinforcing 
in various ways (repetition, agree- 
ment, etc.) (71). Verplanck reports 
that no S gave any evidence that he 
was ‘‘aware’”’ that he was serving asa 
subject in an experiment, that his be- 
havior was being deliberately man- 
ipulated and recorded, or that he rec- 
ognized that there was anything pe- 
culiar about the conversation. Ta 
come within the scope of behavior 
without awareness, however, as dealt 
with in this review, it would have to 
be shown that the Ss were not aware 
that their rate of stating opinions in- 
creased and/or that their opinion- 
stating was being influenced by F’s 
responses (this lack of awareness is 
not necessarily covered by the state- 
ment that they were unaware that 
their behavior was being deliberately 
manipulated). Elsewhere, Verplanck 
has indicated that behavior without 
awareness has been obtained, but has 
not yet published his results in detail 
(72).18 


16 The reviewer, while serving as S for 
Verplanck in an informal discrimination ex- 
periment with “trading’”’ cards, apparently 
began to use the color blue as a cue without 
being aware of using it; this result, which was 
quite convincing and startling to the reviewer, 
is unfortunately not easily replicated. 
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I. Behavior Involving Stim- 
ulus Discrimination 


A. S knows specific nature 
of cue 


B. Sknows general nature 
of cue 


>. S believes cue is being 
given 


. S does not believe cue 
is being given 


II. Behavior Not 
© simulus 
tion 

A. S believes task is to dis 
cover method of re- 
sponding 

. S does not believe task 
is to discover method 
of responding 


Involving 
Discrimina- 


rABLE 2 


Stupies INCLUDED IN THIs REVIEW 
Replications of preceding experiments indicated by (R); positive results by (+); negative 
results by (—); alternative explanations by (?). 


Behavior Not Learned During 
Experiment 


Peirce & Jastrow (+) 

Fullerton & Cattell (+) 

Sidis (+) 

(R) Stroh, Shaw, & Washburn 
(+) 

(R) Pillai (+) 

Coover (+-) 

Baker (+) (?) 

(R) Hilgard, Miller, & Ohlsen 
(—) 

Williams (+) 

(R) Miller (+-) 

(R) Vinacke (—) 


Dunlap (+) (?) 

(R) Titchner & Pyle (—) 
(R) Manro & Washburn (+) 
(R) Hollingworth (+-) 

(R) Bressler (+-) 

Perky (+) 

Miller (+-) 

Sidis (+-) 

Coover (?) 

(R) Collier (+-) 


Hansen & Lehmann (+) 
(R) Kennedy (+-) 





Behavior Learned or Strength- 
ened During Experiment 


Newhall & Sears (+-) (?) 

Baker (+-) (?) 

(R) Wedell, Taylor, & Skol- 
nick (—) 

(R) Hilgard, Miller, & Ohlsen 
(—) 

Lazarus & McCleary (+-) (?) 


Hull (+) (?) 

Smoke (+-) (?) 

Heidbreder (+) (?) 
Goodnow & Postman (+) (?) 


Postman & Jarrett (+-) (?) 
Philbrick & Postman (+-) (?) 


Miller (+) 

Kennedy (+-) (?) 

Thorndike (+) (?) 

Thorndike & Rock (+) (?) 

Cohen, Kalish, Thurston, & 
Cohen (+) (?) 

Klein (+) (?) 

Taffel (+) (?) 

Sears & Cohen (+) 

Cohen, Hilgard, & Wendt 
(+) 

Razran (+) 

Diven (+) (?) 

Haggard (+-) (7) 

Lacey & Smith (+-) 

Eriksen & Kuethe (+) 

Hildum & Brown (+) (?) 

Nuthmann (4+4-) (?) 


Sidowski (+-) (?) 
Weiss (—) 


Thorndike (+4-) (?) 
Greenspoon (+) (?) 
Verplanck (+) (?) 
Mandler & Kaplan (+-) (7) 
Rees & Israel (+). 
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Mandler and Kaplan (42) ran 28 


Ss with the Greenspoon procedure, 
saying ‘‘mm-hmm”’ after every plural- 
noun response during the reinforce- 
ment period. Each S gave 500 re- 
sponses, the first 100 being used to ob- 
tain the operant level, the last 200 
being extinction trials. An extended 
interview given to each S was used 
to determine the degree of awareness 
of the reinforcing contingency, on a 
rating scale of from 1 (full aware- 
ness) to 11 (complete lack of aware- 
ness). The ratings ranged from 3 to 
11 with a mean of 6. The Ss asa 
group did not show a significant in- 
crease in plural noun responses, but 
those Ss who thought (as determined 
by the interview) that “mm-hmm” 
meant they were doing all right in- 
creased from about 12% operant 
level to about 19% during the first 
reinforcement period (100 trials) 


whereas those who thought that 


‘“‘mm-hmm” meant they were not do- 
ing all right declined from about 12% 


to 7%, even though their mean 
awareness was 5.6 compared with a 
6.4 for the “positive’’ group! This 
difference between the two groups in 
plural-noun responses is significant at 
the .01 level. During the second rein- 
forcement period, however, the “‘posi- 
tive’ group fell down to about 9%, 
not significantly different from the 
‘negative’ group. The fact that 
many Ss were to some extent aware 
plus the problem of correlated hy- 
potheses makes these results difficult 
to evaluate. 


JOE K. ADAMS 


SUMMARY AND CONCLUSIONS 


Table 2 presents a classification of 
the studies on behavior without 
awareness covered by this review. 
Positive results are indicated by 
““(+-),”" negative results by ‘‘(—),” 
replications of preceding experiments 
by ‘‘(R),” and studies for which al- 
ternative explanations have been 
suggested, either by this reviewer or 
by others (in some cases the authors 
themselves) or which do not present 
conclusive evidence are indicated by 
““(?).”" If we remember that the re- 
sults in I, D tend to be small and/or 
unreplicated, then it is clear from the 
table that, contrary to a widespread 
conviction among psychologists, the 
only type of behavior without aware- 
ness which can be easily reproduced 
on the basis of published reports is 
the classical type, in which S knows 
what he is supposed to be discrim- 
inating, but does not know that he is 
discriminating, because of the ab- 
sence of the usual sensory experiences 
to which he is accustomed under the 
given type of stimulation. Neverthe- 
less we know (in some sense of the 
word) that many other kinds of be- 
havior without awareness do fre- 
quently occur, and psychologists con- 
tinue to report their demonstration 
in the laboratory. The establishment 
of conditions under which these other 
types can be unequivocally demon- 
strated by and measured by any 
competent researcher remains an in- 
teresting and important challenge to 
experimental ingenuity. 
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The observation that abnormali- 
ties of behavior are relatively fre- 
quent among the children of persons 
who themselves behave abnormally 
antedates scientific psychology by a 
number of centuries. For several 
decades, argument among scientists 
has waxed and waned as to whether 
biological inheritance or social in- 
heritance is primarily responsible for 
this phenomenon. In the nature- 
versus-nurture controversy, little no- 
tice has been paid to the fact that the 
tendency of particular abnormalities 
of behavior to “run in families’ might 
be useful to behavioral scientists re- 
gardless of whether the ultimate 
cause lies in genetic endowment or in 
environmentally determined experi- 
ence, and regardless of the magnitude 
of the correlation between social and 
biological inheritance. This potential 
usefulness lies in the fact that indi- 
viduals in a population with a known 
and relatively high incidence rate for 
a particular disorder may be sub- 
mitted to longitudinal investigation 
of a kind which would not be eco- 
nomical in samples drawn from the 
general population. For example, as- 
suming it were possible to select a 
sample of 100 infants in which it was 
known that 68 cases of schizophrenia 
would eventually develop, detailed 
comparison on a variety of measures 
over a period of years might yield 
refined concepts regarding the etiol- 
ogy and prediction of schizophrenia 
in terms of the crucial differences be- 
tween the 68 persons who ultimately 
developed the disease and the 32 who 
did not. Similarly, in a sample of this 
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kind and batting against a known ex- 
pectancy of 68%, one might evaluate 
the efficacy of any desired mental 
hygiene program in preventing schiz- 
ophrenic breakdown. In such a sam- 
ple, we believe longitudinal study of 
the kind proposed would be prac- 
ticable. By way of contrast, a ran- 
dom sample selected from the general 
population to assure an eventual total 
of 68 cases of schizophrenia for pur- 
poses of a similar investigation, would 
require the investigator to start with 
something like 8,000 infants. This 
figure is based upon an estimate of 
0085 as the incidence of schizo- 
phrenia in the general population. 
The difficulties involved in a detailed 
longitudinal investigation with this 
number of subjects are obvious, 
whether with predictive or preven- 
tive goals in view. 


THE BASE-RATES PROBLEM 


The reader may recognize in the 
foregoing an instance of the ‘‘base- 
rates’ problem. This problem and 
its implications for psychologists 
were discussed by Slater and Slater 
(22) in 1944, and elaborated by Meehl 
and Rosen (10) in 1955. In essence, 
these authors called attention to the 
fact, often neglected by investigators 
seeking to predict behavior from psy- 
chometric patterns or cutting scores, 
that the predictive efficiency of an 
instrument cannot be judged without 
knowledge of the base rate or fre- 
quency of the variable in question in 
the population where such predic- 
tions, if successful, would be of im- 
portance. They point out, for exam- 
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ple, that in seeking to predict suicides 
within a psychiatric population or 
disciplinary discharges among Army 
recruits, the psychologist and his 
tests are working in the face of odds 
which are probably well nigh unbeat- 
able. A minute percentage of ‘‘false 
positives’ in the traditional test- 
validation process, when reflected 
against the total population of psy- 
chiatric patients or against all Army 
inductees, is likely to render the psy- 
cologist wrong much more often than 
if he had stuck with the base rates 
and predicted no suicides and no 
disciplinary discharges. 

Eventually, like it or no, we will 
have to come to grips with the high 
probability that the base-rates prob- 
lem applies in the prediction of men- 
tal disorder from kind and number of 
traumatic life experiences, just as it 
applies in the case of psychometric 
prediction. Generalizations as to the 
etiology of mental disorder based 
upon a posteriori inferences from the 
life histories of clinical cases likewise 
are valid only if it can be shown that 
the incidence of mental disorder is 
higher among individuals who subse- 
quently undergo similar life experi- 
ences than among those who undergo 
different experiences. 

As Meehl and Rosen have re- 
marked, behavioral scientists rarely 
concern themselves with establishing 
base rates. Studies of various clinical 
groups are likely to report the fre- 
quency of broken homes, over-pro- 
tective mothers, perception of paren- 
tal coitus, etc., in the backgrounds of 
patients without any indication as to 
the frequency of similar ‘“‘traumatic”’ 
events in the lives of normal persons. 
The reader is generally left to sur- 
mise that such occurrences are as rare 
among normals as they are character- 
istic of patient populations. 

Studies like that of Oltman, Mc- 
Garry, and Friedman (12) in relation 
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to a single and relatively discrete be- 
havioral datum (parental deprivation 
incident to the broken home) in nor- 
mal, as well as in abnormal, groups, 
are rarities in psychological literature. 
Even less frequent are investigations 
like that of Renaud and Estess (17) 
which seek to disclose the totality of 
pathogenic experiences in the child- 
hood of normal, adequately function- 
ing persons through life-history inter- 
viewing and by other means. 

Oltman et al. found that groups of 
normals, schizophrenics, manic-de- 
pressives, and other psychotics did 
not differ significantly with respect 
to the incidence of broken-home 
backgrounds. Psychopathic person- 
alities and neurotics did tend to come 
from broken homes with slightly, but 
significantly, greater frequency than 
did the psychotic and normal per- 
sons. Renaud and Estess were im- 
pressed by the fact that the life his- 
tories of their 100 normal men re- 
vealed “as many traumatic events 
and pathogenic factors as are ordi- 
narily elicited in interviews with 
many psychiatric patients.” 

If one accepts the results of these 
studies and takes any reasonable es- 
timate of the frequency of disabling 
psychiatric abnormality in the entire 
population, he must perforce con- 
clude that in predicting life-adjust- 
ment outcomes it is safest to stick 
with the base rates. Judging from 
this evidence, the chances are that a 
child will function adequately as an 
adult whether or not he is rejected by 
his parents, mishandled in his toilet 
training, abused by his siblings, or 
whatever. 

If subsequent research supports the 
findings that psychic trauma in child- 
hood is so widespread as to render the 
early life histories of psychiatric pa- 
tients indistinguishable from those of 
a vast number of normals, it will be 
necessary to re-examine the preva- 
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lent assumption that unfortunate life 
experiences are both necessary and 
sufficient causes of ‘‘functional’’ men- 


tal disorder. It will likewise be neces-: 


sary to re-examine the rationale of 
therapies and prophylactic measures 
predicated on this assumption. 

The question as to why some indi- 
viduals emerge from the worst possi- 
ble socio-biological backgrounds to 
become and remain productive, well- 
adjusted adults, while other individ- 
uals from good backgrounds become 
severely neurotic or psychotic in the 
face of moderate or minimal stress 
can hardly be answered by methodol- 
ogy which ignores the base rates. In 
following the suggestion of Meehl and 
Rosen that we should study sub- 
populations with base rates higher 
than obtain in the sampling universe 
for relatively rare behavior abnormal- 
ities, there is much to be said for a 
closer look at the methodology and 
findings of human geneticists with re- 
spect to mental disorder. 


MODELS FOR PSYCHOGENETIC 
RESEARCH 


Huntington’s chorea, perhaps the 
first mental disorder to be thoroughly 
understood from a genetic point of 
view, supplies a paradigm for a psy- 
chogenetic approach to the base-rates 
problem, and illustrates a possible 
means for beating the base rates in 
longitudinal research on the psycho- 
metric detection of organic brain 
damage. 

Psychologists have long been in- 
trigued by the notion that tests of in- 
tellectual functioning might be de- 
vised which would be of practical 
importance in distinguishing people 
with demonstrable brain lesions from 
other people with ‘‘functional’’ im- 
pairment and from normals. To date, 
efforts to develop such measures have 
been largely disappointing, due pos- 
sibly to inherent sampling problems 
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(24). If one utilizes psychiatric pa- 
tients submitted to psychosurgery 
and compares test performances be- 
fore and after leucotomy or lobotomy, 
it is difficult to justify any far-reach- 
ing generalizations regarding frontal 
lobe functions in people who are not 
sufficiently psychotic to require such 
surgery. Cases of accidental brain 
trauma rarely accommodate an in- 
vestigator by undergoing examina- 
tion with the desired test battery 
prior to sustaining head injury. In 
military service in time of war it is 
safe to predict considerable numbers 
of brain injuries among casualties, 
but tests which can be administered 
economically to the enormous num- 
ber of recruits in which these brain 
injuries later chance to occur are not 
likely to be sensitive to the subtle in- 
tellectual changes which are believed 
of clinical importance. 

Huntington’s chorea circumvents 
these sampling difficulties rather well 
in providing a population with a 
known incidence of easily recogniza- 
ble and relatively homogeneous brain 
damage. It isa progressive degenera- 
tive disease of the central nervous 
system with an incidence (base rate) 
of about .0000543 in the general pop- 
ulation (14). Thus, if one wished, he 
might draw a sample from the uni- 
verse of infants and begin testing, se- 
cure in the knowledge that a known 
proportion of them would eventually 
develop Huntington’s chorea and 
consequently be identifiable as defi- 
nite cases of organic brain disease. 
Their rates of mental development 
and deterioration might be studied 
before and after symptoms devel- 
oped, and might be contrasted with 
those for the remainder of the sample 
who did not develop this disease. 
Among the 8,000 infants already se- 
lected for our hypothetical longi- 
tudinal study of schizophrenia, the 
chances would be only a little better 
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than three in ten that even a single 
case of Huntington's chorea would 
appear. If we wished to ensure a 
working N of 68 cases of Hunting- 
ton’s, a sample drawn at random 
from the universe of infants would 
require an initial number something 
like 1,245,421—say a million and a 
quarter in round numbers. 
Fortunately for investigative pur- 
poses, in Huntington’s chorea we are 
not dependent upon random selection 
from the general population. Neither 
must we adhere to a strictly birth to 
death follow-up to ensure measures 
of premorbid and deteriorated levels 
of functioning in victims of the dis- 
ease. By restricting our sample to 
children, one of whose parents was 
known to have Huntington's chorea, 
we immediately raise the base rate 
from .0000543 to .50000. This fact 


has been demonstrated empirically in 
repeated studies of choreic kinships 
which indicate that in Huntington's 
chorea the genetic concept of a single 


autosomal dominant gene with com- 
plete penetrance serves to predict and 
explain the observed facts. 

Second, while the onset of symp- 
toms in Huntington’s chorea has been 
observed as early as birth and as late 
as age 77, the disease usually mani- 
fests itself in middle life. The average 
age of onset is about 36, with a stand- 
ard deviation of around 12 years, 
and a distribution (allowing for prob- 
able systematic reportorial inaccura- 
cies in observed data) which approx- 
imates a normal curve (1, 14). Thus 
something like 68% of the cases will 
begin to show symptoms of Hunting- 
ton’s chorea while in the 24- to 48- 
year age range. If we were to begin 
with a sample of 100 persons of the 
same age selected at random from 
among all individuals with one chor- 
eic parent, and begin an intensive 
longitudinal study of those who ap- 
peared clinically normal at age 24, we 
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might expect within 24 years to ob- 
serve the onset of the deteriorating 
process in 34 cases. Thus, an N of 200 
subjects selected on the basis of .50 
genetic probability at birth would 
provide a working N of 68 clinical 
cases over a period of 24 years. Six- 
teen cases would be expected to ap- 
pear before the study began and six- 
teen cases after it terminated. 

Figure 1 indicates the relationship 
between age and the probability of 
subsequently developing Hunting- 
ton’s chorea for children with one 
affected parent. It may also indicate 
to the reader appalled by the thought 
of a 24-year study that judicious se- 
lection of subjects in the age groups 
most liable to development of the 
disease would provide opportunity to 
observe its onset and course in a high 
percentage of cases within a much 
shorter period. In any case, however, 
human psychogenetic research re- 
quires a temporal reorientation for 
most behavioral scientists accus- 
tomed to projects of limited duration, 
quick results, and early publication. 

Huntington’s chorea is almost 
unique among mental disorders in 
that environmental variables appear 
to be of little significance in influenc- 
ing the disease process. If the abnor- 
mal gene is present, the disease will 
manifest itself in any environment 
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capable of sustaining life, and indi- 
vidual differences as regards age of 
onset, rate of progression, neurologi- 
cal and psychological symptoms, etc., 
are seemingly more explicable in 
terms of different biotypes or genetic 
strains among afflicted: kinships than 
in terms of differences in environ- 
ment, either early or late in the vic- 
tims’ life histories (3). 

In the more common “functional” 
mental disorders which confront 
thousands of psychological investi- 
gators and practitioners in their daily 
work, the situation is somewhat dif- 
ferent. No one among contemporary 
authorities of genetic persuasion 
would deny that differences in en- 
vironment contribute heavily to be- 
havioral variance within and between 
abnormal groups. In spite of this 
fact, the extensive data on the fa- 
milial incidence of mental disorders 
do afford the basis for develop- 
ing age-specific expectancy charts 
with applications similar to those in 
our illustration with Huntington’s 
chorea. To avoid the necessity for 
detailing the reasons psychologists 
give for generally ignoring geneticists’ 
findings, we may assume that any 
tendency of mental disorders to run 
in families is attributable entirely to 
social inheritance. Without prejudice 
to the idea of preselected subpopula- 
tions, we may assume that the bio- 
logical endowment of human em- 
bryos for potential tolerance to stress 
is substantially identical, subject 
only to differing experience during 
and subsequent to parturition. 

Considering first schizophrenia, the 
example utilized at the beginning of 
this paper of a sample with an ex- 
pectancy for the disease of 68% was 
not merely hypothetical. This figure 
is based upon the finding in pedigree 
studies by Kallmann (8), that 68.1% 
of children born to matings where 
both parents were schizophrenic de- 
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veloped the disease themselves. 


Where a single parent was known to 
be schizophrenic, 16.4% of the off- 
spring developed the disease. 

Determination of age of onset in 
schizophrenia presents somewhat 
greater problems than in Hunting- 
ton’s chorea, and reliable figures are 
in short supply. For purposes of psy- 
chogenetic investigation, a major 
complicating factor is the fact that 
early onset has been numbered 
among the diagnostic criteria for 
schizophrenia. Slater (20) has pre- 
sented data on the ages at onset of 
schizophrenia among 67 siblings of 
156 propositi or index cases, thus af- 
fording somewhat greater confidence 
that a reasonable picture of the inci- 
dence in older age groups is secured 
than would a much larger number of 
schizophrenic index cases diagnosed 
by conventional procedures in usual 
psychiatric practice. 

Figure 2 reflects Slater’s.data on 
age of onset against Kallmann’s risk 
figures for schizophrenia among sub- 
jects with one and with both parents 
known to be schizophrenic. 

We must recognize possible limita- 
tions due to sampling and methodo- 
logical errors in the studies cited. 
With this qualification, it is obvious 
that if an investigator seeks by a de- 
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velopmental approach to understand 
the crucial factors which distinguish 
the social inheritance of persons who 
become schizophrenic from that of 
their siblings and others who do not, 
he can spare himself considerable 
effort by selecting his cases from 
populations where probabilities are 
high that the necessary etiologic con- 
ditions will obtain. 

The affective psychoses, likewise, 
have been shown to occur with 
greater frequency among relatives of 
affected propositi than among people 
in general. Since there is an ascer- 
tainable (though as yet inadequately 
investigated) pattern for age in rela- 
tionship to frequency of diagnosis, it 
would be quite possible to develop 
age-specific expressions of probability 
for developing an affective psychosis 
based upon empiric risk figures al- 
ready at hand for various subpopula- 
tions. 

Manic-depressive 


psychosis ap- 


pears to be somewhat less frequent 


in the general population (base rate 
about .004) than does schizophrenia. 
In spite of this fact, the economies 
effected by studying a subpopulation 
of persons, each with one manic-de- 
pressive parent, would be even 
greater than in the case of schizo- 
phrenia, in that about 23% of the 
offspring in matings with one manic- 
depressive parent will also develop 
the disease, as against the 16.4% 
morbidity in schizophrenia. 
Geneticists have sought to explain 
these differences in base rates and 
familial morbidity (along with some 
of the other interesting findings in 
these disorders) in terms of a domi- 
nant mode of inheritance in manic- 
depressive psychosis versus a reces- 
sive in schizophrenia, with the intro- 
duction of several other constructs 
such as partial penetrance and poly- 
genic modifying factors to account for 
some of the troublesome details. Per- 
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sons emphasizing the factor of social 
inheritance might seek to explain 
these differences in other terms. Per- 
haps children simply find it easier to 
identify with a cyclothymic parent 
than with a schizoid one, the former 
being more likable in their better 
moments. Possibly because manic- 
depressive disorders are believed on 
the average to have a later onset than 


‘ do the schizophrenias, children of the 


former have a longer average ex- 
posure time to the unfortunate pa- 
rental influence, and consequently a 
heightened risk of patterning their 
personalities after undesirable mod- 
els. Hypotheses could be multiplied, 
but the important thing for behav- 
ioral scientists is to examine the facts 
to see if they really do demand ex- 
planation. If so, the explanation 
should be sought empirically and ex- 
perimentally where there is a reason- 
able prospect of finding it. This could 
well be in longitudinal studies of ap- 
propriate samples with high base 
rates. 

Psychoneuroses and_ sociopathic 
personality disturbances are more 
difficult to study from the standpoint 
of inheritance than are the psy- 
choses, and have received less atten- 
tion from human geneticists, although 
the evidence from several sources is of 
interest (6, 9, 21). Traditionally, 
these conditions are regarded as 
purely psychogenic, even by the psy- 
chologists who admit the possibility 
of inherited biological predispositions 
toward psychotic modes of adjust- 
ment. Under our assumption con- 
cerning social inheritance, however, 
it seems reasonable and consistent 
with all current personality theories 
to predict that the incidence of neu- 
roticism, psychopathy, etc., will be 
relatively high among the children of 
parents with these difficulties. Ac- 
cordingly, such children should con- 
stitute an acceptable high base-rate 
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group for investigators with even the 
most rigid environmentalistic biases. 
The problem remains of determining 
base rates and age expectancies for 
these disorders. Unless this problem 
is overcome, longitudinal studies 
would be limited so far as assessment 
of mental hygiene measures is con- 
cerned. It should still be possible, 
however, to make important empiri- 
cal observations of children of neu- 
rotic and psychopathic parents, and 
possibly to demonstrate the nature of 
the crucial differences between the 
psychological environments of those 
children who later become neurotic 
or psychopathic and the children who 
later remain well adjusted, despite 
sharing the same physical environ- 
ments with their less fortunate sibs. 

Presumably, base rates for fre- 
quency of neurosis and psychopathy 
in the general population would not 
run as high as the 19.04% judged 
neurotic and 3.56% judged psycho- 
pathic by Pearson and Amacher (13) 
in a large sample of unmarried moth- 
ers. ven if the over-all base rates 
were to approach or exceed these fig- 
ures, the economies inherent in using 
samples selected because of predis- 
posing social inheritance (and/or bio- 
logical inheritance) should prove con- 
siderable. 

In the foregoing examples, the ac- 
ceptance of traditional broad diag- 
nostic categories such as ‘‘the schizo- 
phrenias,”’ ‘‘the affective psychoses,” 
and ‘‘the psychoneuroses’’ may be 
deemed objectionable by some. The 
principle underlying a psychogenetic 
approach to behavior disorders re- 
mains the same, however, even if one 
were to restrict a study to the nature 
and frequency of neuroses in female 
children of mothers with conversion 
hysteria manifested by glove anes- 
thesia of the left hand. The present 
need is for studies under broad ru- 
brics, and the categories cited above 
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are convenient starting points. 
Eventually, if the limits of prediction 
of behavior from knowledge of pa- 
rental behavior can be demonstrated, 
a psychogenetic approach might lead 
to a highly refined classificatory sys- 
tem for behavior disorders, even if a 
crisp definition like that of ‘‘normal’’ 
and ‘“‘affected’’ in Huntington’s 
chorea cannot be attained elsewhere. 


PREVIOUS CONSIDERATIONS OF 
GENETIC FACTORS IN PSYCHO- 
LOGICAL RESEARCH 


Comparative psychologists have 
long recognized the need for control 
of biological inheritance in experi- 
ments with animal behavior and the 
possibilities for utilizing strains bred 
to provide high or low base rates for 
particular behaviors. ‘‘Maze-bright’”’ 
and ‘“maze-dull” rats produced 
through genetic selection are familiar 
to almost every student of intro- 
ductory psychology, and even Hunt- 
ington’s chorea has its counterpart in 
the mouse (23, 25). The term ‘‘psy- 
chogenetics’” appears first to have 
been used by Hall (5), who decried 
the relative lack of attention to in- 
herited behavioral dispositions by 
psychologists, and who saw as a po- 
tent cause of individual differences 
within species, structures (genes) 
which can produce such varied forms 
as the flea and the elephant or the 
starfish and the kangaroo. Unfor- 
tunately, Hall foresaw little possibil- 
ity for developing a science of human 
psychogenetics, apparently feeling 
that the long span of years per human 
generation, and the notorious reluc- 
tance of people to mate in accordance 
with an experimental design, were 
effective barriers to worthwhile appli- 
cation of genetic principles. Scott 
(19), likewise, saw genetics as a tool 
in psychological research, but only in 
terms of inbred and, therefore, sub- 
human strains. 
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As indicated earlier, psychological 
research involving genetic factors in 
human personality attributes to date 
has been geared almost exclusively 
to efforts to disclose the relative po- 
tency of heredity and environment 
in determining psychological charac- 
teristics. Extensive methodologies 
have been developed, and numerous 
efforts to apply these have been re- 
corded, particularly as regards indi- 
vidual differences in human intelli- 
gence (11). Most authorities are in 
agreement that both heredity and 
environment have something to do 
with the production of such differ- 
ences, but there has been a dearth of 
suggestions as to how this conclusion 
can serve as a steppingstone to better 
prediction and control of behavior. 
The fact that there is still room for 
wide disagreement between some who 
assign a minimal role to inheritance 
and others who feel that environment 
only operates to produce differences 
within narrow limits fixed by hered- 


ity is probably due primarily to the 
uncertain factorial composition of 
human intelligence, with consequent 
uncertainty as to the contribution of 
errors of measurement to total vari- 


ance. 

In studies of nature and nurture in 
relation to emotional adjustment, the 
difficulties in defining variables and 
in constructing valid and reliable 
measures have proved even greater 
than in the case of intelligence. The 
recent application of criterion analy- 
sis to measures of neuroticism among 
twins by Eysenck and Prell (4) and 
Cattell’s suggestion (2) for use of the 
multiple-variance method in com- 
paring populations with differing de- 
grees of genetic and environmental 
homogeneity are interesting meth- 
odological advances. Both, however 
involve the difficulties generally ap- 
plying in twin studies; evidence since 
Galton’s early proposals has shown 
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twinship to be considerably less than 
the ideal provision by nature for a 
planned experiment (16). 

There is possibly an even more 
basic difficulty common to past ef- 
forts directed toward the nature-nur- 
ture problem in human_ behavior. 
This is in the proclivity of psycholo- 
gists for assuming or demonstrating 
variables to be distributed in contin- 
uous fashion throughout the general 
population, but concentrating atten- 
tion on the pathological extremes, 
which may, in fact, constitute dis- 
crete series. The familiar concept of 
a normal distribution for ‘‘emotional 
adjustment” ranging from “super 
normal” and ‘“‘normal’’ to ‘‘neurotic”’ 
and “psychotic” has led laymen and 
behavioral scientists alike to picture 
human emotions in varying shades of 
gray. Various schemata have been 
proposed to place particular malad- 
justments on continua like that of 
Richards (18) who regards schizo- 
phrenia and hysteria as opposite poles 
of an extratension-introversion con- 
tinuum. Still more specific kinds of 
behavior are treated as continuous. 
We may even regard the mother who 
projects the sensations from her own 
full bladder and insists that her young 
daughter has to go to the bathroom 
as different in degree, but not in 
kind, from a lady with the fixed false 
belief that the President has erotic 
designs upon her person. 

While such conceptualizations may 
serve a useful purpose, they may also 
be misleading. The danger lies in the 
temptation to infer continuous dis- 
tribution of underlying etiological 
factors from the fact that behavioral 
traits appear to be so distributed. 
Huntington's chorea may again il- 
lustrate the point: Symptomwise, vic- 
tims of this disease might be regarded 
as the extreme of the distribution for 
motor steadiness or manual dexterity, 
which evidence suggests are normally 








414 


distributed in the general population. 
However, regardless of the nature of 
the causes of individual differences in 
these traits among people in general, 
it is clear that the Huntington's 
chorea victims constitute a group 
apart, different not in degree, but in 
kind, by reason of some specific bio- 
chemical error which is highly pre- 
dictable in terms of inheritance, and 
which operates in a manner quite 
different from anything observed in 
nonchoreic kinships. 

The study of Eysenck and Prell 
cited earlier led these authors to con- 
clude that there is a biological factor 
of neuroticism in which biological in- 
heritance contributes about 80% of 
the variance. However, their as- 
sumption that neuroticism is on a 
continuum in the general population 
and the samples employed make it 
impossible to infer that clinical cases 
of neurosis arise at the extreme of the 
continuum only because of the degree 
to which they inherit the neuroticism 


factor. Testwise or symptomwise the 


diagnosed neurotics do constitute 
the extreme of a distribution, but 
the reasons for their coming to this 
sorry end may be quite different 
from the reasons which cause indi- 
viduals in the “borderline” or ‘‘nor- 
mal” range of test scores or clinical 
behavior to fall where they do. Just 
as Huntington’s choreics constitute a 
group apart (etiologically) on the mo- 
tor-steadiness continuum, so clinical 
neurotics might fall at the end of 
the distribution by reason of a dif- 
ferent kind of heredity, either biologi- 
cal or social. If one surveys the work 
on human inheritance, a general pat- 
tern emerges as regards etiological 
differences between normal and path- 
ological degrees of variability. In 
genetic terms, normal variability of 
hereditary characteristics within a 
species is' explained on the basis of 
polygenic inheritance. People who 
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are very tall or very short, but still 
within the “‘normal’”’ range, appear to 
manifest the influence of many genes 
or gene pairs, each with small but 
similar effect. At both extremes of 
the distribution for height, however, 
observations indicate that a number 
of hereditary pathological syndromes 
are determined by single genes or 
gene pairs of very potent influence in 
determining the metabolic processes 
of the entire organism. Chondrodys- 
trophic dwarfism has been shown to 
be transmitted as a dominant in a 
number of kinships, as has its symp- 
tomatic inversion, arachnodactyly 
(9). At least some forms of gigan- 
tism operate as recessive characters 
(7). f 

With respect to psychological char- 
acteristics, some direct evidence is 
provided in the area of intelligence. 
Again, from the genetic point of view, 
studies showing high parent-child 
correlations on measures of intelli- 
gence throughout the normal range 
are interpreted as indicating that nu- 
merous gene pairs are involved in the 
production of these individual differ- 
ences. At the low extreme of the dis- 
tribution, however, the existence of 
several etiological agents transmitted 
after the fashion of single genes or 
gene pairs has been demonstrated. 
Among congenital varieties of mental 
deficiency, microcephaly, Tay-Sachs 
disease, or infantile amaurotic idiocy, 
and several other syndromes have ap- 
peared as recessive characteristics in 
numerous kinships, while others such 
as epiloia behave as dominants. De- 
menting conditions in which indi- 
viduals seemingly normal for a time 
after parturition deteriorate intel- 
lectually and develop the appearance 
of mental deficiency weeks, months, 
or years following birth likewise may 
show either dominant or recessive in- 
heritance. Phenylpyruvic acid oligo- 
phrenia, juvenile amaurotic idiocy, 
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Alzheimer’s disease, and, of course, 
Huntington’s chorea are prime ex- 
amples (15). The writers are even of 
the opinion that at the extreme upper 
end of the distribution for intelli- 
gence, as well, single-factor biological 
inheritance may possibly account for 
some of the gross anomalies observed 
from time to time. 

The inference that the pathological 
extremes of the “emotional adjust- 
ment continuum’ may be deter- 
mined by unitary factors of biological 
inheritance because this seems to be 
the case with other physical and men- 
tal traits of humans is provocative. 
However, as in the case of other anal- 
ogies, it may not be regarded as 
proof. To do so would be an error of 
the same sort as is commonly com- 
mitted in inferring etiological factors 
for maladjustment to be distributed 
along a continuum from the fact that 
symptomatic behaviors appear to be 
so distributed. However, when the 


weight of evidence from direct genet- 


ic studies of emotional disorder is 
thrown into the balance along with 
the suggestive analogy above, the 
possibility appears very real that the 
individuals who become ill with any 
of several ‘‘functional’’ disorders are, 
in fact, discrete populations, bio- 
logically predisposed by unitary in- 
herited factors toward particular 
kinds of abnormal behavior now gen- 
erally regarded as arising from an un- 
fortunate kind or number of learning 
experiences. 

The writers’ bias in emphasis on 
biological inheritance is by now ap- 
parent. The fact remains, however, 
as stated at the outset, that the longi- 
tudinal study of specified subpopula- 
tions is equally consonant with an 
environmental bias, to say nothing of 
an open-minded approach. The base- 
rates problem in regard to etiology of 
mental illness is a very real and pres- 
ent one for psychological science. So 
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is the possibility that the presence of 
the single anomalous molecule of pro- 
tein which we term an abnormal gene 
may override all other considerations 
in the prediction of abnormal be- 
havior. If the mainstream of psycho- 
logical thought continues to ignore 
these problems, the result might be 
the loss of an opportunity to fore- 
shorten by years the search for truly 
effective means of preventing and 
treating mental illness. Beyond this, 
if these problems are rejected as not 
within our province, the result might 
be the abdication by psychology of its 
cherished role as a leader in objective- 
ly oriented research among the be- 
havioral sciences. 


PRACTICAL PROBLEMS IN HUMAN 
PsvCHOGENETIC RESEARCH 


In the foregoing section, the sug- 
gestion is implicit that mental dis- 
order is in large part a genetic and, 
therefore, ultimately a biochemical 
problem. Assuming this to be true, 
the question immediately arises, 
“Why not leave it to the geneticists 
and biochemists to work it out, and 
let behavioral scientists struggle 
along with present methods for deal- 
ing with these real-life problems until 
something better can be devised?” 
The answers to this question are sev- 
eral. Probably foremost is the fact 
that scientists have or should have an 
interest in being right, as well as in 
being effective. A given theory may 
do fairly well in subsuming observa- 
tions and enabling predictions, but 
it is the mark of the scientist not to 
rest until a better system is devised. 
Certainly the present status of pre- 
diction and control of human behav- 
ior disorders allows little room for 
satisfaction among its serious stu- 
dents, let alone smugness. 

A second reason why psychology 
should concern itself with psycho- 
genetics in relation to mental disorder 
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lies in the relative numbers and loca- 
tions of persons with requisite train- 
ing in either psychological or génetic 
research methodology. Persons with 
graduate-level training in human 
genetics are as yet few in number, 
and tend to find employment almost 
exclusively in university settings. 
Psychologists, on the other hand, are 
relatively numerous and abound in 
clinics and hospitals where the index 
cases necessary to psychogenetic in- 
vestigation present themselves. 

A third factor favoring hybridiza- 
tion of psychological and genetic sci- 
ences in this area is the positive con- 
tribution which psychology can make 
in speeding the progress of work from 
a strictly genetic point of view. Pro- 
cedures for establishing and main- 
taining rapport with human subjects 
and principles of communication with 
persons lacking in, intelligence and 
not familiar with sclentific jargon are 
two of several areas in which geneti- 
cists might profit from the experience 
of psychologists. 

Granting that increased cross fer- 
tilization between psychology and 
genetics is desirable in the conception 
of research on mental disorders, there 
remain several questions as to the 
practicality of longitudinal investiga- 
tions as proposed earlier. Realisti- 
cally, what are the chances that a 
population of infants could be identi- 
fied as each having two schizophrenic 
parents? The chances are, perhaps, 
much better than they appear at first 
glance. If investigators in various 
locations were alerted to the potential 
importance of such subjects in subse- 
quent research, even the seemingly 
irreducible incidence of matings by 
schizophrenics within the best-run 
mental hospitals across the country in 
a single year would probably provide 
a sample large enough to be reckoned 
with when one considers the base 
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rate or expectancy of 68%. Subjects 
selected on this basis would, almost 
without exception, be placed in adop- 
tive homes. They could be compared 
with other adopted children of non- 
schizophrenic parents placed by the 
same agencies, to give an indication 
of the importance of adoptive place- 
ment per se in the etiology of schizo- 
phrenia or such other disorders as 
might appear in the two groups. 

The morbidity expectancies for 
other subpopulations discussed ear- 
lier would be lower, but, by the same 
token, the subjects would be more 
readily available. Children with one 
normal and one schizophrenic or 
manic-depressive parent would pre- 
sent no particular problems. Recog- 
nized matings of two psychopaths 
(again with a high proportion of off- 
spring placed in adoptive homes) are 
not uncommon, and there is quite 
possibly an affinity of male neurotics 
for female neurotics, which could be 
turned to account. In any case, even 
if much more restrictive diagnostic 
categories were employed, the inci- 
dence of usable cases would probably 
be many times greater than that of 
monozygotic twins reared in dissimi- 
lar environments—a kind of subject 
which, by virtue of its rarity, has 
proved the stumbling block in many 
well-conceived experiments on genet- 
ic factors in human behavior. 

Given the names and addresses of a 
sizable list of persons selected for a 
particular psychogenetic study, what 
would be the chances of enlisting the 
cooperation of all the interested par- 
ties necessary for repeated submis- 
sion to psychometric and other stud- 
ies over a period of years? Again, the 
possibilities are probably not as re- 
mote as they may appear. Terman’s 
studies of genius have shown that 
subjects in longitudinal studies will 
remain in contact with an investiga- 
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tor for years, expend a great deal of 
effort, and divulge all sorts of highly 
personal information with no reward 
other than the satisfaction of being 
singled out for study as intellectually 
gifted. Our own work in Hunting- 
ton’s chorea indicates that a large 
majority of persons with 50% prob- 
ability for developing the disease are 
willing to participate in efforts to 
predict and control the disease. Over 
200 subjects have evidenced this will- 
ingness in submitting to psychological 
and electroencephalographic exam- 
ination on one or more occasions. 
Many of them have traveled hundreds 
of miles at their own expense to do so, 
despite foreknowledge that the re- 
sults of the study were likely to be of 
no direct or immediate benefit to 


them or their families. 

This experience may have no im- 
plications for similar approaches to 
“functional” disorders where the fa- 
milial morbidity is lower and where 
the exact nature of the etiology is in 


doubt. However, this is no basis for 
rejecting the approach without try- 
ing it. A number of authorities ques- 
tioned on a priori grounds the feasi- 
bility of informing large numbers of 
choreic families about the inheritance 
of the disease and of inviting their 
participation in predictive studies 
only to have the fears proved ground- 
less when an effort was made. The 
same finding might very well obtain 
in populations selected to show a high 
incidence of schizophrenia or other 
disorders if a frank and simple pres- 
entation were made of the nature 
and purpose of the study. 

To judge from our experience with 
persons under the dire threat of 
Huntington’s chorea, the picture of 
human personality (even the per- 
sonality one might expect to encoun- 
ter among a group of preschizo- 
phrenic or preneurotic individuals) 


as a fragile thing likely to be precipi- 
tated into catastrophic reaction by 
the intimation that emotional disor- 
ders may run in one’s family, is a 
specter which exists in the minds of 
behavioral scientists and not, as a 
rule, among the persons most directly 
concerned, Instead, one might ex- 
pect to find a certain toughness of 
psyche and a desire to face facts, 
coupled with the feeling that such au- 
thorities in the field of mental health 
as they may have consulted have not 
been completely (a) informed or (6) 
honest concerning the etiology of 
emotional disturbances. 

The recent rise of quiz shows, 
opinion polls, and surveys, together 
with the generally heightened public 
awareness of the importance of scien- 
tific research, appear to have created 
an atmosphere conducive to coop- 
eration in any investigation under 
legitimate sponsorship, even if it in- 
volves the sort of data collected in the 
Kinsey studies. Asking laymen to 
take voluntarily a series of personal- 
ity tests, or to have their brain waves 
recorded for the sake of science, is 
now likely to evoke general accept- 
ance of a sort rare only a few years 
ago except among college sopho- 
mores. The limits of this public 
warming toward scientific investiga- 
tion have not been exploited, nor, 
perhaps, should they be. The fact 
remains, however, that one cannot as- 
sess the reaction of the popu! ation in- 
volved toward a psychogenetic re- 
search design until he tests it. In 
Huntington's chorea, which cer- 
tain respects constitutes an acid test, 
the reactions have been favorable. 

Possibilities also exist for utilizing 
a psychogenetic approach to the 
study of human emotions without 
ever divulging to the subjects the fact 
that they were special objects of 
study. In Cattell’s efforts to apply 
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the multiple-variance method, for 
example, it was proposed to test the 
entire school grades in which selected 
subjects such as twins reared by 
different foster parents chanced to be 
placed.! Thus children, parents, and 
school authorities would know at most 
that a particular classroom was under 
consideration, without further iden- 
tification of the desired subjects ex- 
cept by the investigator. 

In short, the practical obstacles to 
psychogenetic research in terms of 
availability of subjects do not appear 
insurmountable. As indicated earlier, 
patience and forbearance are prereq- 
uisite virtues in investigators seek- 
ing to use this approach. Financial 
support is not easily obtained for pro- 
posals which reckon in decades rather 
than in months the interval before 
results will be forthcoming. Again, 
however, such problems should not 
prove insurmountable. Probably the 
immediate need is for additional 


studies which would verify or modify 
the base-rate expectancies and ages 


of onset for different disorders in 
different subpopulations. These could 
be completed in a relatively short 
period if any considerable number of 
psychologists or other behavioral sci- 
entists were persuaded to undertake 
them. 

In the longitudinal study of samples 
drawn from such subpopulations, co- 
operative effort by a number of in- 
vestigators, each studying a few in- 
dividuals spread across the country 
might prove a successful alternative 
to a study conducted by a single re- 
search center. It is obvious that a 
multidisciplinary approach should 
prove most fruitful in such research. 
For example, if someone went to the 
trouble of making arrangements nec- 
essary for the study of 100 infants 

! Cattell, R. B. 
1954. 


Personal communication, 
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with 68% expectancy for schizo- 
phrenia, the experimental design 
should not be restricted to intelli- 
gence and personality testing at 
specified intervals. Rather, the best 
available talent in anthropology, 
sociology, biochemistry, neurophysi- 
ology, biophysics, and other dis- 
ciplines should be focused on such a 
unique opportunity. The question as 
to who will take the lead in creating 
such opportunities remains open. 


SUMMARY 


The long recognized tendency of 
many abnormal behavior patterns to 
run in families may prove useful in 
understanding, predicting, and con- 
trolling such behavior, regardless of 
whether the ultimate cause lies in 
social heredity or in biological hered- 
ity, and regardless of the extent to 
which these are correlated. Studies 
by human geneticists and others pro- 
vide data which make it possible to 
calculate age-specific statements of 
probability for developing various 
disorders in various subpopulations 
defined in terms of genetic relation- 
ship to index cases. These subpopula- 
tions might be studied longitudinally 
in efforts to disclose the crucial differ- 
ences in heredity (either biological or 
social) between subjects who develop 
the disorders in question and those 
who do not. Studies of such subpopu- 
lations would probably prove much 
more economical than would investi- 
gations in random samples from the 
general population, by reason of the 
much higher base rates for incidence 
of the specified disease in the former. 
Preventive or mental hygiene meas- 
ures could also be evaluated economi- 
cally in longitudina! efforts with sub- 
populations having known base rates, 
The base-rates problem is discussed 
with respect to predicting psychiatric 
illness from kind and number of 








GENETIC EXPECTANCIES AS BASE RATES 


traumatic life experiences, and it is 
suggested that this problem has been 
grossly neglected in behavioral sci- 
ence. Examples of genetic expect- 
ancies as age-specific base rates for 
Huntington’s chorea and _ schizo- 
phrenia are presented and the pos- 
sibilities for similar expressions in 
other disorders are discussed. Much 


previous psychological research on 
the nature-nurture question may in- 
volve the erroneous assumption that 
because behavioral variables appear 
to be continuously distributed in the 
general population, the underlying 
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etiological factors are also thus dis- 
tributed. Examples are cited to show 
that from the etiological standpoint 
the extremes of distributions for 
several physiological and psychologi- 
cal characteristics actually consti- 
tute discrete series. Practical prob- 
lems with respect to psychogenetic 
research on human behavior disorders 
are discussed. It is recommended 
that psychologists and others in- 
terested in human behavior give 
further consideration to possibilities 
in this area. 
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THE -ILES THAT PLAGUE ELEMENTARY STATISTICS 
HORACE B. ENGLISH 


Ohio State 


Every instructor who attempts to 
instill a few notions of elementary sta- 
tistics in his students knows how hard 
it is to gain acceptance for the desig- 
nation of the top or best as the fourth 
quartile. After all, the student has 
been accustomed most of his life to 
speaking of the best quarter of his 
class as the first. ‘“‘A quartile is-just a 
quarter, isn’t it?” 

It is unfortunate that the statisti- 
cians who initiated the numerical 
designation for quartiles and other 
partiles did not think of possible con- 
fusion with a common speech usage; 
but the ranking of all the partiles 
from the bottom up is now too well 
established in technical usage to be 
changed. Rationalization of the 
names and numerical designations of 


the several concepts involved in di- 
viding a ranked distribution is, how- 
ever, highly desirable in the interests 


of easy communication. In working 
through the tangle for the Dictionary 
of Psychological and Psychoanalytic 
Terms,' I developed a schema that is 
relatively simple and self-consistent, 
and that puts a minimum strain on 
established technical usage or on 
ordinary habits of thought—the last 
being a most important criterion for 
good terminology. 

A partile is defined as “the generic 
name for one of the set of points that 
divide a serially ordered or ranked 
distribution into a number of divi- 
sions, each of which contains the same 
number of scores. 
cated as coinciding with the obtained 
score in the distribution below which 
the required fraction of the obtained 
scores is found. If there are 100 such 


1 Longmans Green, New York, 1958. 


Each point is lo- - 


University 


divisions (each containing 1/100 of 
the scores), the division points are 
called percentiles (sometimes centiles, 
but this usage, though logical, con- 
flicts with another established usage). 
Other partile points are named by 
adding -ile to the root of the Latin 
ordinal number: e.g., decile (setting 
off divisions of 1/10 of the cases), 
octile (1/8), sextile (1/6), quintile 
(1/5), quartile (1/4), tertile (1/3).” 

Note that partile is defined as a 
point. This has always been the us- 
age of careful writers, but the intro- 
duction of percentile for the more logi- 
cal centile to designate the dividing 
point left centile with no job. It 
quickly took over the task of naming 
a hundredth part of a ranked distri- 
bution; and this sort of designation, 
extended downward, led to naming 
the tenth part a decile, the fourth 
part a quartile, etc. 

The lower ranking -ile terms 
tertiles, deciles, etc.—were thus bur- 
dened with a double task: to name 
the dividing point but also the parts 
separated by the points. This does 
not seem too confusing until we note 
that, in the case of the quartile, e.g., 
the top quartile division is the fourth 
while the top quartile point is the 
third. A good mathematical mind 
says “Of course’ but the average 
student says “‘How come?’ Move 
up, moreover, to the higher partiles 
and ask even a good statistical stu- 
dent where the 95th centile division 
is—whether it is fourth, fifth, or sixth 
from the top. He won't be able to 
answer quickly; and errors in desig- 
nation, resulting in misrepresenta- 
tion of the facts, are not uncommon. 

It won't avoid all confusion but it 
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if we follow best 


the -iles when 


minimize it 
usage and restrict 
written without qualifiers to the 
dividing points. (The only exception 
to this rule is centile and even that 
might well be avoided as shown be- 
low.) 

How then shall we name the divi- 
sions or parts set off by the partile 
points? My own preference is simple. 
What are the two parts divided by 
the median? Even in the most techni- 
cal tomes, they are halves. Why not 
then thirds, fourths, sixths, tenths, 
hundredths? If the context makes 
clear that we are dealing with a 
ranked distribution, these are wholly 
unambiguous. But for those who 
prefer a term that looks a little more 
technical, tertile division, decile divi- 
sion, etc., are also unambiguous. But 
just plain decile is not. If we call the 
division a hundredth, we get rid of 
the illogicality of centile. If we must 
use an -tle term, we should parallel 
tertile division with centile division, 


will 


not just centile. 

A third concept is involved: the 
range of scores in any division. This 
has been called the partile interval 
(quartile, quintile, etc., interval) and 
is clear. 

There remains finally the question 


of ranks. What about such terms as 
decile rank, percentile (or is it centile) 


HORACE B. 


ENGLISH 


rank? Of course, the designating 
number given to either the dividing 
point or to the division is its rank. 
The rank of quartile three (the point 
divider) is 3. The rank of the top 
percentile is 99. 

But rank in this connection is com- 
monly used to assign a rank order to 
the individual: to a score, a person, a 
particular item. And this is done by 
assigning to this individual the rank 
of the partile division in which the in- 
dividual is found. Thus the rank of 
an individual is accurately given as 
quartile division rank, and as centile 
division rank (not percentile rank, 
which introduces not only semantic 
but numerical error). This may be 
expressed elliptically as quartile rank 
or centile rank but we must remember 
it is the rank of the division, not of 
the dividing point. 

All of this will seem to the working 
statistician to belabor the obvious. 
It is not the obvious for the beginning 
student. Nor is it simple. It is not 
as simple a scheme as one could de- 
vise if we were beginning afresh. A 
scientific terminology usually grows 
by accretion, often in peculiar direc- 
tions. But this scheme seems to put 
as much clarity and rationality into 
the pattern as current usage permits. 
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CARTWRIGHT 


University of Chicago 


In a recent article in this journal 
(4), Schaeffer and Levitt have pro- 
vided a valuable survey of procedures 
for Kendall's r-coefficient rank- 
correlation (3). One feature of their 
presentation, however, may cause 
readers to employ significance tests 
yielding spurious results. 

They define the r-coefficient thus 
(4, p. 339): 


ol 


S 


n—1)/2 


T=— {1 ] 
n( 

They state that when there are ties 
Eq. 1 may also be used when agree- 
ment with an untied objective rank- 
ing is being determined (4, p. 340). 
They continue (4, p. 340): “In gen- 
eral, however, the corrected formula 
should be used since rank correlations 
are usually computed when agree- 
ment rather than accuracy is the 


1 
var S= (nin —1)(2n+: 
18 


1 


9n(n—1)(n : 


1 = 
1 (5 wn) 
2n(n—1) 2X . 
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issue.”’ For the corrected formula 
they specify (4, p. 339 


S 


2-V 


T= 
V/n(n—1) 


-/n(n—-1)/2—U 


1 This note was prepared in connection with 
research supported in part by funds from the 
Ford Foundation (Psychotherapy arch 
Fund), and in part by a grant (PHS M 903 
from the National Institute of Mental He ilth, 
of the National Institutes of Health, Public 
Health Service. 
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( >. v(v—1)(v—2) 
= uu-1)), 


where 


1 
> ys v(v—1) 


1 
| = " 2 (u—1). 


In their footnote number 5 (4, p. 
341) the authors present the basic 
equation and several derivative equa- 
tions for the variance of r when there 
are ties. ‘They state that these for- 
mulas are to be found in Kendall (3). 
In fact, they do not appear in the 
form in Kendall's discussion. 
The latter gives expressions for the 
variance of S, the numerator of Eq.1 
and Eq. 2. 

Kendall’s basic equation is (3, pp. 


43, 57-60) 


same 


~1)(204+5)— >> u(u—1)(2u+-5) 


)( z: u(u—1)(u~2)) 


[3] 


The basic equation and the deriva- 
tive equations for the variance of r 
as given by Schaeffer and Levitt (4, 
p. 341) may be expressed as 


4 
: --var S, 
n*(n—1)? 


9 


[4] 


where var S is given by Eq. 3. The 
first term on the right of Eq. 4 is 
equal to the reciprocal of the de- 
nominator of Eq. 1 squared. Now, 
letting D denote the denominator of 
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a formula for r when there are ties, 
we may write 


a,?=var S/D?, (5) 


which is true for all formulas, given 
only that D is in fact the denominator 
for the particular r whose variance is 
to be computed. But Eq. 4, and all 
variance expressions given by Schaef- 
fer and Levitt (4, p. 341) are applica- 
ble only to Eq. 1, not to the general 
formula for ties Eq. 2. 

Moreover, in discussing corrections 
for continuity on S (C,) when there 
are ties (4, pp. 341-342), the authors 
recommend that C, be divided by 
n(n—1)/2 in order to obtain the cor- 
rection for rT. 

Now, we may write 


C,=C,/D, (6) 


which is true for all formulas for r 
when there are ties, given only that 
D is in fact the denominator for the 
particular rt for which C, is to be com- 
puted. But the division recom- 
mended by Schaeffer and Levitt is 
applicable only to Eq. 1, not to the 
general formula Eq. 2. 

Let us now examine a case illustrat- 
ing the kind of error that may result 
from failure to apply Eq. 5 and Eq. 6 
correctly. Eighteen boys and 16 girls 
are ranked on composite letter grades 
over a series of essays. The general 
formula (Eq. 2) is used to measure the 
association, if any, between sex and 
grades. 


We have: 
n= 34 
S= 106 
V =273 
U=59 
D= 380-235 


} 
| 
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We also find 
var S = 3261.7754, 
oe= 57.11, 
C,= (68-1-1) /26 = 2.54, 
[Whitfield’s correction (5), | 
S corrected = S,= 106-2.54= 103.46. 


Then, using the normal probability 
integral (3, p. 143) we find 


S. 103.46 
— = —_—_—_— o= 1.810; 


p<.08. 

o, 457.11 

If now we employ the correction 

and variance expressions given by 
Schaeffer and Levitt, we find 


a,?=.010364, 
o,=.1018, 
c,=5.08/1122 = .0045, 
r corrected = r,=.279—.0045 =.2745, 


and 


Te. «2745 
—=—— g=2.700; - p<.008. 
a, .1018 


The discrepancy between p<.08 
for S, and p <.008 for 7, is due solely 
to our failure to apply Eq. 5 and Eq. 
6 correctly. Our results fail to satisfy 
the identity 


[7] 


which must hold if it is accepted that 
a given result has one and only one 
p value. 

Now, if we use Eq. 1 to determine 
7 for this example, and continue to 
use the correction and variance ex- 
pressions given by Schaeffer and 
Levitt, we find 
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7T=.1889, 
T.=.1844, 


and 


7, .1844 


o, .1018 


=1.8le; p<.08. 
This result satisfies Eq. 7. 
Alternatively, if we use Eq. 2 as be- 
fore but compute the variance of r 
by Eq. 5 and the correction by Eq. 6, 
we have 
r=.279, 
a,?= .02256, 
a, = .1502, 
C,.= .0067, 
tT, = .279— .0067 = .2723, 


S.. wees 
—=——-¢g=1.8le; p<.08. 
o, .1502 
This result also satisfies Eq. 7. 
As Schaeffer and Levitt point out, 
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the formulas for ties in r are compli- 
cated; but not more so than those for 
rho when the latter is properly em- 
ployed. The authors discuss several 
advantages of r over rho. A serious 
disadvantage has been the computa- 
tional labor in obtaining S, especially 
when » is large. However, reasonably 
rapid procedures have now been de- 


-veloped by Bright (1) and by Cart- 


wright (2). In the latter article it is 
also pointed out that it is not neces- 
sary to transform raw scores to ranks 
when computing S, which provides a 
turther advantage over rho. 

In testing the significance of an ob- 
tained r when there are ties, it is prob- 
ably best to follow the routine of first 
computing var S by Eq. 3 or by the 
derivatives of Eq. 3 for different con- 
ditions of ties as given by Kendall (3, 
pp. 43-45). The quantity C, may 
then be computed by the methods 
described by Schaeffer and Levitt 
(4, pp. 341-342). Using the appro- 
priate D, 0, and C, may then be com- 
puted by Eq. 5 and Eq. 6, respec- 
tively. 
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COMMENT ON A DISTRIBUTION-FREE 
FACTORIAL-DESIGN ANALYSIS 
FRED D. SHEFFIELD 
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In a recent article, Wilson (4) has 
proposed a distribution-free method 
for analyzing the results of factorial- 
design experiments. The ideal case 
involves running an equal number of 
subjects in each cell and dichotomiz- 
ing at the over-all median of the 
quantitative scores obtained. The 
values analyzed are the frequencies 
above and below the over-all median. 
Subsequently Alluisi (1) has pre- 
sented computational formulas for 
Wilson’s method. 

Perhaps because the revised cell 
values are frequencies, Wilson and 
Alluisi used a x’ approach in their anal- 
ysis. With this approach they ad- 
duce a new set of formulas for coping 
with factorial design in terms of x?. 
They could, however, have treated the 
frequencies as scores and used the al- 
ready well-known procedures of anal- 
ysis of variance. The main purpose 
of the present comment is to reinter- 
pret Wilson’s method in more familiar 
terms. This purpose seems salutary 
if only to simplify the problems of the 
teacher and student of statistics, a 
field which has become overly com- 
plex and specialized in recent years. 
If a studerit knows analysis of vari- 
ance he can handle this special case 
without recourse to Wilson’s elabora- 
tion of x? formulas. 

In the ideal case of the Wilson 
method, the null hypothesis is that 
each observation in a cell has a 50-50 
chance of falling above the over-all 
median. If N is the number of ob- 
servations per cell, then the range of 
possible frequencies above the me- 
dian is from 0 to N and the mean is 
N/2. As is well known, the variance 
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of a frequency is NPQ, which in the 
present instance is N(.5)(.5)=WN/4, 
since the null hypothesis is that 
P=Q=.5 cases above the median. 
This value of N/4 is therefore the 
within-cell variance. Moreover, it is 
a theoretically given within-cell vari- 
ance, as if the degrees of freedom for 
its “estimation” were infinite. 

From this point on the analysis of 
variance of the frequency scores can 
proceed in the usual way. This is ex- 
emplified below with the illustrative 
data used by Wilson. These data in- 
volve a 3X3 factorial design with 16 
cases per cell. Thus the range of cell 
scores is from 0 to 16, the mean cell 
score is 8, and the variance of cell 
scores on the null hypothesis is 
16(1/4), or 4. The obtained cell 
scores in Wilson’s example are shown 
in Table 1. It should be understood 


TABLE 1 


Illumination 





| 
| 
| 


el 2 3 | Total 

A a 8 ww 37 

Dials B 9 7 8 | 24 
C Ths 3 2 |. 11 

Total | 29 #22 «21 72 


| 
=. BD Sinn 
that while these cell values are fre- 
quencies above the median they are 
nevertheless scores in the same sense 
that the yield of a plot is a score in 
the agricultural experiments used as 
illustrations by R. A. Fisher (3). 
They differ in being less continuous, 
but are little more ‘‘rounded”’ than if 
one had coded a continuous distribu- 
tion into about 17 class intervals. 
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Most students familiar with ortho- 
dox analysis of variance will be able 
to follow the analysis of Wilson's data 
shown below. The analysis should be 
regarded as one in which there is 
only one score per cell but in which 
there is an a priori within-cell vari- 
ance. The computations below work 
from marginal totals rather than 
marginal means, but this is the usual 
approach in most texts which show 
computational methods for analysis 
of variance (e.g., 2). 


Total 2 Squares: 
= (196+ .44+4121+481+449+ 64+ 36 
(72)? 
+9+44)— 
) 
128 
2 Squares for Dials: 
[(37)?+(24)?+(11)?] (72)? 
rc 3 9 
= 112.67 


> Squares for Illumination: 


[(29)?+ (22)?+4(21)*] 

= 3 
= 12.67 

2 Squares for Interaction: 


= 128— 112.67 —12.67 
= 2.67 


Since the error term is 4, the above 
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> squares give rise to the analysis of 
variance in Table 2. The implications 
of these F tests are exactly the same 
as those of Wilson’s x? analysis. In 
fact, since x? divided by its df is dis- 
tributed the same as F for infinite df 
in the smaller variance, the present 
F values can be transformed into 
Wilson’s x? values by multiplying F 
by df (except for some minor calcula- 
tion errors in Wilson’s paper). 

Thus far the present comments 
have been oriented toward a transla- 
tion of Wilson’s method into conven- 
tional analysis of variance. From this 
standpoint the above table is as far 
as analysis of variance can be carried 
with the nonparametric assumption 
that the within-cell variance is NPQ 

4. Whether the above F tests are 
made or whether Wilson's x? analysis 
is made, a single theoretical variance 
is the only error term possible, and 
all marginals and interactions must 
be tested against this single (within- 
cell) error term. Thus it is not possi- 
ble to compare the mean squares of 
the two marginals with each other or 
with the Interaction mean square 
since these latter are empirical, i.e., 
parametric, values. This represents 
a fairly important limitation on what 
can be done with the nonparametric 
approach. 

The importance of this limitation is 
apparent in the above data, in which 
F for Illumination is not at all sig- 
nificant by the nonparametric test 
but would be well within the 5 per 
cent level if tested in the conventional 


TABLE 2 


Squares df 
Dials 112.67 
Illumination 12.67 
Interaction 2.67 
Total 
(between cells) 


128.00 


Mean Square 
34 
34 


. 
6] 


00 
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way. Thus in a typical 3 X3 factorial 
design experiment with only one 
score per cell there is no within-cell 
error because of the lack of replica- 
tions. The only error term available 
in such a case is the Interaction of 
the two marginal variables. If this 
parametric approach is applied 
above, the F for Illumination against 
interaction is 6.34/0.67, or about 9.5, 
which is well beyond the 6.94 needed 
at the 5 per cent level for 2 and 4 df, 
respectively. The corresponding non- 
parametric test (F=1.58) does not 
even reach the 20 per cent level of 
confidence. 

The reason for this gross discrep- 
ancy in p values is readily apparent. 
The over-all F by the nonparametric 
test immediately disproves the null 
hypothesis and makes it not only an 
insensitive but also an untenable as- 
sumption against which to test for 
possible effects of a relatively weak 
marginal. The effects of Dials hap- 
pens to be a strong one which holds 
up for all three Illumination settings 
and is significant even with the non- 
parametric test. This implies that 
the Illumination groups are stratified 
on an important variable, and while 
the Illumination means (or totals) 
are fairly close together, they differ 
more than chance allows for samples 
stratified on such an important vari- 
able. Another way of saying the same 
thing is that whereas the Illumina- 
tion effects are small, they tend to 
hold up under all Dials conditions. 
Hence the significant F for Illumina- 
tion against Interaction. The import 
is that the nonparametric approach 
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will detect large effects but is rela- 
tively insensitive to small effects 
which are demonstrated by the data 
but which are not detected by the 


‘false null hypothesis. 


In concluding this comment it 
should be noted that the Wilson 
method actually involves two sep- 
parate parts. One part is a procedure 
for creating an approximately normal 
set of scores from an originally non- 
normal set. This part of the method 
consists of dichotomizing at the 
median (or some equally reasonable 
cutting point) and making a score out 
of the number of values falling above 
this cutting point. The second part 
is a procedure for testing obtained 
variances against the theoretical vari- 
ance implied by the cutting point 
used (i.e., ¢? = NPQ). Only the second 
part of the method is the distribution- 
free part, and it suffers from the lim- 
itation that if the nonparametric hy- 
pothesis is proven false for the data 
as a whole, there is no proper method 
of testing the individual factors with- 
out recourse to parametric methods. 
A possible compromise is to use the 
nonparametric, error term to test 
over-all F but to resort to parametric 
error terms if the over-all F disproves 
the original null hypothesis. This 
compromise is not possible within the 
framework of Wilson’s x? approach, 
which permits only comparisons be- 
tween empirical variances and the 
theoretical variance. It is naturally 
suggested, however, by the present 
reinterpretation of Wilson’s method 
in terms of conventional analysis of 
variance. 
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Recently there have been at least 
three different book reviewers who 
have commented on the confusion 
that currently exists in the psycho- 
logical literature regarding the statis- 
tical concepts of confidence and signi- 
ficance (2, 7,9). Although this confu- 
sion can be partially explained as a 
semantic problem, it behooves the 
psychologist to examine these two 
concepts rather closely and to adopt 
pristine terminology for the benefit of 
beginning students and individuals of 
other disciplines that draw rather 
heavily upon the psychological litera- 
ture. 


CONFIDENCE AND CONFIDENCE 
COEFFICIENTS 


Confidence, a concept customarily 
reserved for discussions of interval 
estimation, is the faith which one is 
willing to place in a statement that 
an interval established by a sampling 
process actually contains or bounds 
a parameter of interest. One gen- 
erally expresses this faith statistically 
by affixing to each interval a confi- 
dence coefficient, or confidence proba- 
bility, which can be written as 1—e, 
where €= p/100 for 0<p<100, and 
p is usually taken to be a very small 
number (1, 3, 6, 8). For example, if 
p=5 the confidence coefficient would 
be .95, and one would refer to the in- 
terval with which this coefficient is 
associated as the 95% confidence 
interval. “ij @2 

The confidence coefficient is fre- 
quently interpreted in the following 
manner: If one were to draw samples 
of size K from a population of N ele- 
ments (K naturally being <\) and 


from each sample establish a 95% 
confidence interval on some specified 
parameter of the population, then in 
the long run about 95% of the to- 
tality of these intervals would ac- 
tually contain the parameter of in- 
terest, and approximately 100 €%, or 
5%, of them would not (3). This in- 
terpretation is correct, but of course 
assumes (%) to be a rather large num- 
ber.' 


SIGNIFICANCE AND SIGNIFICANCE 
LEVELS 


Significance, as contrasted to con- 
fidence, is given to the testing of hy- 
potheses.. Here one makes a state- 
ment, i.e., states an hypothesis, which 
will hereafter in this discussion be 
represented as H, that may be either 
true or false and then takes action on 
this H by accepting or rejecting it. 
Clearly, any one of the following ac- 
tions is a likely outcome as a result of 
testing an H: (a) rejection of a false 
H; (6) acceptance of a true H; (c) re- 
jection of a true H; or (d) acceptance 
of a false H. It is quite evident that 
actions (a) and (b) are desirable, 
while (c) and (d) carry the connota- 
tion of committing an error—c being 
the familiar Type I error or an error 
of the first kind, while d is called a 
Type II error or an error of the sec- 
ond kind (4, 8). 

When one tests an H, the proba- 
bility that he will take action cis de- 
fined as the significance level, which 
we will represent as a (8). Although 
a is generally of the same order of 

' The notation (%) is used here as a com- 


binational symbol to indicate the number of 
ways that K objects can be selected from N. 
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magnitude as €, a and e¢ differ in the 
amount of information which they 
convey, for while € completely tells 
all there is to know about “being 
wrong”’ in interval estimation, @ only 
gives information about a very par- 
ticular type of error, i.e. the action de- 
scribed by c. To emphasize the con- 
trast made here between @ and €, one 
merely needs to examine the other 
type of error that can be made in the 
test of an H. 

For this purpose, let ( represent 
the probability that action d is taken, 
i.e. a Type II error is committed; 
then, by definition, 1—f is known as 
the power of the statistical test or the 
probability that action @ will occur 
(4,8). Although texts in psychologi- 
cal statistics do not seem to place a 
great deal of emphasis upon the 
power of a test, power is the basic 
concept responsible for one’s employ- 
ing statistical tests as a basis for tak- 
ing action on an H. If this were not 


so, to test an H at the 5% level of sig- 
nificance, one could simply draw from 


a box of 100 beads—95 white and 5 
red—a bead at random and adopt the 
convention that he would reject the 
H whenever a red bead appeared. 
With such a test, one can readily see 
that not only a but also 1—f always 
equals .05, or 8=.95. It is this large 
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value of 8 that precludes one’s em- 
ploying the bead-box test. For an ex- 
cellent discussion of 8 and its relation 
to the alternative H against which 
one might be testing, the reader is re- 
ferred to Dixon and Massey (4, pp. 
244-261). 


SUMMARY AND DISCUSSION 


The admixing of the concepts of 
confidence and significance has be- 
come so prevalent in the psychologi- 
cal literature that one typically reads 
statements, in the reports of psycho- 
logical research, indicating that cer- 
tain experimental results were signifi- 
cant at, say, the 5% “‘level of confi- 
dence.” 

It may be that this confusion arises 
from the fact that one can utilize a 
confidence interval as a significance 
test (e.g., see 5, p. 241), and in doing 
so-may hastily, but incorrectly, con- 
clude that there is no difference be- 
tween the two concepts. 

Inasmuch as explicit terminology 
is needed to convey the probabilities 
of committing statistical errors in the 
respective areas of interval estima- 
tion and testing of hypotheses, the 
concept of confidence should never be 
associated with the statistical test of 
an H regardless of the nature of the 
test being employed. 
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